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PKEFACB 

An  American  student  approaching  the  higher  parts  of  mathe- 
matics usually  finds  himself  unfamiliar  with  most  of  the  main  facts 
of  algebra,  to  say  nothing  of  their  proofs.  Thus  he  has  only  a 
rudimentary  knowledge  of  systems  of  linear  equations,  and  he  knows 
next  to  nothing  about  the  subject  of  quadratic  forms.  Students  in 
this  condition,  if  they  receive  any  algebraic  instruction  at  all,  are 
usually  plunged  into  the  detailed  study  of  some  special  branch  of 
algebra,  such  as  the  theory  of  equations  or  the  theory  of  invariants, 
where  their  lack  of  real  mastery  of  algebraic  principles  makes  it 
almost  inevitable  that  the  work  done  should  degenerate  to  the  level 
of  purely  formal  manipulations.  It  is  the  object  of  the  present 
book  to  introduce  the  student  to  higher  algebra  in  such  a  way  that 
he  shall,  on  the  one  hand,  learn  what  is  meant  by  a  proof  in  algebra 
and  acquaint  himself  with  the  proofs  of  the  most  fundamental  facts, 
and,  on  the  other,  become  familiar  with  many  important  results  of 
algebra  which  are  new  to  him. 

The  book  being  thus  intended,  not  as  a  compendium,  but  really, 
as  its  title  states,  only  as  an  introduction  to  higher  algebra,  the 
attempt  has  been  made  throughout  to  lay  a  sufficiently  broad  founda- 
tion to  enable  the  reader  to  pursue  his  further  studies  intelligently, 
rather  than  to  carry  any  single  topic  to  logical  completeness.  No 
apology  seems  necessary  for  the  omission  of  even  such  important 
subjects  as  Galois's  Theory  and  a  systematic  treatment  of  invariants. 
A  selection  being  necessary,  those  subjects  have  been  chosen  for 
treatment  which  have  proved  themselves  of  greatest  importance  in 
geometry  and  analysis,  as  well  as  in  algebra,  and  the  relations  of 
the  algebraic  theories  to  geometry  have  been  emphasized  throughout. 
At  the  same  time  it  must  be  borne  in  mind  that  the  subject  primarily 
treated  is  algebra,  not  analytic  geometry,  so  that  such  geometric 
information  as  is  given  is  necessarily  of  a  fragmentary  and  some- 
what accidental  character. 

No  algebraic  knowledge  is  presupposed  beyond  a  familiarity  with 
elementary  algebra  up  to  and  including  quadratic  equations,  and 
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such  a  knowledge  of  determinants  and  the  method  of  mathematical 
induction  as  may  easily  be  acquired  by  a  freshman  in  a  week  or 
two.  Nevertheless,  the  book  is  not  intended  for  wholly  immature 
readers,  but  rather  for  students  who  have  had  two  or  three  years' 
training  in  the  elements  of  higher  mathematics,  particularly  in 
analytic  geometry  and  the  calculus.  In  fact,  a  good  elementary 
knowledge  of  analytic  geometry  is  indispensable. 

The  exercises  at  the  ends  of  the  sections  form  an  essential  part 
of  the  book,  not  merely  in  giving  the  reader  an  opportunity  to  think 
for  himself  on  the  subjects  treated,  but  also,  in  many  cases,  by  sup- 
plying him  with  at  least  the  outlines  of  important  additional  theories. 
As  illustrations  of  this  we  may  mention  Sylvester's  Law  of  Nullity 
(page  80),  orthogonal  transformations  (page  154  and  page  173),  and 
the  theory  of  the  invariants  of  the  biquadratic  binary  form  (page  260). 

On  a  first  reading  of  Chapters  I-VII,  it  may  be  found  desirable 
to  omit  some  or  all  of  sections  10,  11,  18, 19,  20,  25,  27,  34,  35.  The 
reader  may  then  either  take  up  the  subject  of  quadratic  forms 
(Chapters  VIII-XIII),  or,  if  he  prefer,  he  may  pass  directly  to  the 
more  general  questions  treated  in  Chapters  XIV-XIX. 

The  chapters  on  Elementary  Divisors  (XX-XXII)  form  decid- 
edly the  most  advanced  and  special  portion  of  the  book.  A  person 
wishing  to  read  them  without  reading  the  rest  of  the  book  should 
first  acquaint  himself  with  the  contents  of  sections  19  (omitting 
Theorem  1),  21-25,  36,  42,  43. 

In  a  work  of  this  kind,  it  has  not  seemed  advisable  to  give  many 
bibliographical  references,  nor  would  an  acknowledgement  at  this 
point  of  the  sources  from  which  the  material  has  been  taken  be 
feasible.  The  work  of  two  mathematicians,  however,  Kronecker 
and  Frobenius,  has  been  of  such  decisive  influence  on  the  character 
of  the  book  that  it  is  fitting  that  their  names  receive  special  men- 
tion here.  The  author  would  also  acknowledge  his  indebtedness 
to  his  colleague.  Professor  Osgood,  for  suggestions  and  criticisms 
relating  to  Chapters  XIV-XVI. 

This  book  has  grown  out  of  courses  of  lectures  which  have  been 
delivered  by  the  author  at  Harvard  University  during  the  last  ten 
years.  His  thanks  are  due  to  Mr.  Duval,  one  of  his  former  pupils, 
without  whose  assistance  the  book  would  probably  never  have  been 
written. 
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CHAPTER   I 

POLYNOMIALS   AND  THEIR   MOST   FUNDAMENTAL 

PROPERTIES 

1.  Polynomials  in  One  Variable.  By  an  integral  rational  func- 
tion of  x^  or,  as  we  shall  say  for  brevity,  a  polynomial  in  a;,  is  meant 
a  function  of  x  determined  by  an  expression  of  the  form 

(1)  C^X°-^ -\-  Cj^X°-^ -\-   "•    4-  <?A;^% 

where  the  as  are' integers  positive  or  zero,  while  the  c's  are  any  con- 
stants, real  or  imaginary.  We  may  without  loss  of  generality 
assume  that  no  two  of  the  as  are  equal.  This  being  the  case,  the 
expressions  CiX^-t  are  called  the  terms  of  the  polynomial,  c^  is  called 
the  coefficient  of  this  term,  and  a^  is  called  its  degree.  The  highest 
degree  of  any  term  whose  coefficient  is  not  zero  is  called  the  degree 
of  the  polynomial. 

It  should  be  noticed  that  the  conceptions  just  defined  —  terms, 
coefficients,  degree  —  apply  not  to  the  polynomial  itself,  but  to  the 
particular  expression  (1)  which  we  use  to  determine  the  polynomial, 
and  it  would  be  quite  conceivable  that  one  and  the  same  function  of 
X  might  be  given  by  either  one  of  two  wholly  different  expressions 
of  the  form  (1).  We  shall  presently  see  (cf.  Theorem  5  below) 
that  this  cannot  be  the  case  except  for  the  obvious  fact  that  we 
may  insert  in  or  remove  from  (1)  any  terms  we  please  with  zero 
coefficients. 

By  arranging  the  terms  in  (1)  in  the  order  of  decreasing  a's  and 
supplying,  if  necessary,  certain  missing  terms  with  zero  coefficients, 
we  may  write  the  polynomial  in  the  normal  form 

(2)  a^x""  +  a-^x""-^ -f  •••  +  a,,_iX  +  a^^ 

B  1 
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It  should,  however,  constantly  be  borne  in  mind  that  a  polynomial 
in  this  form  is  not  necessarily  of  the  nth  degree ;  but  will  be  of  the 
nth  degree  when  and  only  when  a^  ^  0. 

Definition.  Two  polynomials^  /i(^)  <^^^  /2(^)'  ^^^  ^^**^  ^^  ^^ 
identically/  equal  (/^  =/2)  if  thei/  are  equal  for  all  values  of  x.  A 
polynomial  f{x)  is  said  to  vanish  identically  (f=0)  if  it  vanishes  for 
all  values  of  x. 

We  learn  in  elementary  algebra  how  to  add,  subtract,  and  multi- 
ply *  polynomials ;  that  is,  when  two  polynomials  f^ix)  and  f,^{x)  are 
given,  to  form  new  polynomials  equal  to  the  sum,  difference,  and 
product  of  these  two.  ^ 

Theorem  1.     If  the  polynomial 

f{x)  =  a^x""  +  a-^x'"''^  -h  •••  +  a„ 

vanishes  when  x  =  a^  there  exists  another  polynomial 

such  that  f(x)  =  {x—  a)^^{x). 

For  since  by  hypothesis /( a)  =  0,  we  have 
f{x)  =f(x)  -f(a)  =  a^ix^  -  a-)  +  a^(x^-'  -  a^-')  +  -  +  a^.,(x  -  a). 

Now  by  the  rule  of  elementary  algebra  for  multiplying  together 
two  polynomials  we  have 

x^  —  a^={x—  a)  (x^~^  +  aa;*"^  +  •  •  •  +  «*"^). 
Hence 

f{x)  =  (x-a)  [ao(^""^  +  a^"""^  +  •••  +  a"~0  +  a^ix""-^  +  ax""-^  +  —     ^ 

If  we  take  as  (t>i{x)  the  polynomial  in  brackets,  our  theorem  is 
proved. 

Suppose  now  that  /3  is  another  value  of  x  distinct  from  a  for  which 
fix)  is  zero.     Then 

*  The  question  of  division  is  somewhat  more  complicated  and  will  be  considered 
in  §63. 
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and  since  yS  — a^O,  <^j(/3)  =  0.  We  can  therefore  apply  the  theo- 
rem just  proved  to  the  polynomial  (l>i(x)^  thus  getting  a  new 
polynomial  ^^(^)^«^^»-a+ „.^«-3  +  ...  +„._^ 

such  that  <^^(x)  =  {x—  ^)(^^{x\ 

and  therefore  f{x)  =  {x—  a){x  —  ^)<^^{x). 

Proceeding  in  this  way,  we  get  the  following  general  result : 
Theorem  2.     If  a^t  «2^  •••  ot,^  are  k  distinct  constants,  and  if 

f(x)  =  a^x''  +  a^x""-^  4-  ...  +  a^  {n>Jc), 

and      •  f(a^)  =f{a^  =  •••  =f{ak)  =  0, 

then  f{x)  =  (x  —  aj)(x— a2) -"(x  —  a/^)(f)(x), 

where  <^{x)  =  a^x^'^  +  h^x'^~^~'^  -}-...+  5^_^. 

Applying  this  theorem  in  particular  to  the  case  n  =  A:,  we  see  that 
if  the  polynomial  f{x)  vanishes. for  n  distinct  values  Wj,  Wg^  •••  a^  of  x, 

fix)  =  a^{x  -  a^){x  -a2)"'(x-  «„). 

Accordingly,  if  a^^O,  there  can  be  no  value  of  x  other  than  a^,  •••  a„ 
for  which /(a;)  =  0.     We  have  thus  proved 

Theorem  3.  A  polynomial  of  the  nth  degree  in  x  cannot  vanish 
for  more  than  n  distinct  values  of  x. 

Since  the  only  polynomials  which  have  no  degree  are  those  all  of 
whose  coefficients  are  zero,  and  since  such  polynomials  obviously 
vanish  identically,  we  get  the  fundamental  result : 

Theorem  4.  A  necessary  and  sufficient  condition  that  a  polyno- 
mial in  X  vanish  identically  is  that  all  its  coefficients  he  zero. 

Since  two  polynomials  in  x  are  identically  equal  when  and  only 
when  their  difference  vanishes  identically,  we  have 

Theorem  5.  A  necessary  and  sufficient  condition  that  two  polyno- 
mials in  X  he  identically  equal  is  that  they  have  the  same  coefficients. 

This  theorem  shows,  as  was  said  above,  that  the  terms,  coefficients, 
and  degree  of  a  polynomial  depend  merely  on  the  polynomial  itself, 
not  on  the  special  way  in  which  it  is  expressed. 
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2.  Polynomials  in  More  than  One  Variable.  A  function  of  (a:,  y) 
is  called  a  polynomial  if  it  is  given  by  an  expression  of  the  form 

where  the  a's  and  yS's  are  integers  positive  or  zero. 

More  generally,  a  function  of  {x-^^  x^^  •••  x^  is  called  a  polynomial 
if  it  is  determined  by  an  expression  of  the  form 

(1)  C^X-^^^X^^  •••  X/^  4-  C^X^-^X^-^  '"  Xn^+    •••   +  Cj^X^°-kX^k  ...  O^/ft, 

where  the  a's,  /8's,  •••  z^'s  are  integers  positive  or  zero. 

Here  we  may  assume  without  loss  of  generality  that  in  no  two 
terms  are  the  exponents  of  the  various  a^'s  the  same  ;  that  is,  that  if 

"i  =  ^j'>  A  =  /%»  •  •  •  /^i  =  t^p 
then  Vi^Vy 

This  assumption  being  made,  CiX-^'^ix^^i---  x^i  is  called  a  term  of  the 
polynomial,  Ci  its  coefficient^  a^  the  degree  of  the  term  in  x-^^  /3^  in  a^g, 
etc.,  and  a^  + /8^+ •••  +  i^i  the  total  degree^  or  simply  the  degree,  of 
the  term.  The  liighest  degree  in  x^  of  any  term  in  the  polynomial 
whose  coefficient  is  not  zero  is  called  the  degree  of  the  polynomial 
in  Xi,  and  the  highest  total  degree  of  any  term  whose  coefficient  is 
not  zero  is  called  the  degree  of  the  polynomial. 

Here,  as  in  §  1,  the  conceptions  just  defined  apply  for  the  present 
not  to  the  function  itself  but  to  the  special  method  of  representing 
it  by  an  expression  of  the  form  (1).  We  shall  see  presently,  how- 
ever, that  this  method  is  unique. 

Before  going  farther,  we  note  explicitl}^  that  according  to  the 
definition  we  have  given,  a  polynomial  all  of  whose  coefficients  are  zero 
has  no  degree. 

When  we  speak  of  a  polynomial  in  n  variables,  we  do  not  nec- 
essarily mean  that  all  n  variables  are  actually  present.  One  or  more 
of  them  may  have  the  exponent  zero  in  every  term,  and  hence  not 
appear  at  all.  Thus  a  polynomial  in  one  variable,  or  even  a  con- 
stant, may  be  regarded  as  a  special  case  of  a  polynomial  in  any 
larger  number  of  variables. 

A  polynomial  all  of  whose  terms  are  of  the  same  degree  is  said 
to  be  homogeneous.     Such  polynomials  we  will  speak  of  as  forms,* 

*  There  is  diversity  of  usage  here.  Some  writers,  following  Kronecker,  apply  the 
term  form  to  all  polynomials.  On  the  other  hand,  homogeneous  polynomials  are  often 
spoken  of  as  quantics  by  English  writers. 
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distinguishing  between  binary^  ternary^  quaternary^  and  in  general, 
n-ary  forms  according  to  the  number  of  variables  involved,  binary 
forms  involving  two,  ternary  three,  etc. 

Another  method  of  classifying  forms  is  according  to  their 
degree.  We  speak  here  of  linear  forms,  quadratic  forms,  cubic 
forms,  etc.,  according  as  the  degree  is  1,  2,  3,  etc.  We  will,  how- 
ever, agree  that  a  polynomial  all  of  whose  coefficients  are  zero  may 
also  be  spoken  of  indifferently  as  a  linear  form,  quadratic  form, 
cubic  form,  etc.,  in  spite  of  the  fact  that  it  has  no  degree. 

If  all  the  coefficients  of  a  polynomial  are  real,  it  is  called  a  real 
polynomial  even  though,  in  the  course  of  our  work,  we  attribute 
imaginary  values  to  the  variables. 

It  is  frequently  convenient  to  have  a  polynomial  in  more  than  one 
variable  arranged  according  to  the  descending  powers  of  some  one 
of  the  variables.  Thus  a  normal  form  in  which  we  may  write  a 
polynomial  in  n  variables  is 

9o\'^2'  ***  *^«/*^i    ~^"  9iv'^2'  **'  *^«)-^i       4-  •••  4- <f>TOV'^2'  ***  *^«)' 
the  <^'s  being  polynomials  in  the  n  —  1  variables  {x^^  ••.  x^)- 

We  learn  in  elementary  algebra  how  to  add,  subtract,  and  multiply 
polynomials,  getting  as  the  result  new  polynomials. 

Definition.  Two  polynomials  in  any  number  of  variables  are 
said  to  be  identically  equal  if  tJiey  are  equal  for  all  values  of  the  vari- 
ables. A  polynomial  is  said  to  vanish  identically  if  it  variishes  for 
all  values  of  the  variables. 

Theorem  1.  A  necessary  and  sufficient  condition  that  a  polyno- 
mial in  any  number  of  variables  vanish  identically  is  that  all  its  coeffi- 
cients be  zero. 

That  this  is  a  sufficient  condition  is  at  once  obvious.  To  prove 
that  it  is  a  necessary  condition  we  use  the  method  of  mathematical 
induction.  Since  we  know  that  the  theorem  is  true  in  the  case  of 
one  variable  (Theorem  4,  §  1),  the  theorem  will  be  completely  proved 
if  we  can  show  that  if  it  is  true  for  a  certain  number  n  —  1  oi  vari- 
ables, it  is  true  for  n  variables. 

Suppose,  then,  that 

vanishes  identically.  If  we  assign  to  (372,  •••  a7„)  any  fixed  values 
(^2?  •  •  •  ^'n)^  f  becomes  a  polynomial  in  xi  alone,  which,  by  hypothesis. 


6  INTRODUCTION  TO  HIGHER  ALGEBRA 

vanishes  for  all  values  of  Xy  Hence  its  coefficients  must,  by  Theorem 
4,  §  1,  all  be  zero :  ,  /•     a  i  x 

That  is,  the  polynomials  0q,  (/>j,  •••  <^^  vanish  for  all  values  of  the 
variables,  since  (0:2,  •••  x'^)  was  ani/  set  of  values.  Accordingly,  by 
the  assumption  we  have  made  that  our  theorem  is  true  for  polyno- 
mials in  ^  —  1  variables,  all  the  coefficients  of  all  the  polynomials 
<^Q,  <^j,  •••  (f)^  are  zero.  These,  however,  are  simply  the  coefficients 
of/.     Thus  our  theorem  is  proved. 

Since  two  polynomials  are  identically  equal  when  and  only 
when  their  difference  is  identically  zero,  we  infer  now  at  once  the 
further  theorem : 

Theorem  2.  A  necessary  and  sufficient  condition  that  two  poly- 
nomials he  identically  equal  is  that  the  coefficients  of  their  correspo7iding 
terms  he  equal. 

We  come  next  to 

Theorem  3.  Iffi  andf2  are  polynomials  in  any  numher  of  vari- 
ahles  of  degrees  m^  and  m2  respectively,,  the  product  f^f 2  will  he  of  de- 
gree Wj  +  ^2- 

This  theorem  is  obviously  true  in  the  case  of  polynomials  in  one 
variable.  If,  then,  assuming  it  to  be  true  for  polynomials  in  ^  —  1 
variables  we  can  prove  it  to  be  true  for  polynomials  in  n  variables, 
the  proof  of  our  theorem  by  the  method  of  mathematical  induction 
will  be  complete. 

Let  us  look  first  at  the  special  case  in  which  both  polynomials 
are  homogeneous.  Here  every  term  we  get  by  multiplying  them 
together  by  the  method  of  elementary  algebra  is  of  degree  m^  +  m^. 
Our  theorem  will  therefore  be  proved  if  we  can  show  that  there  is  at 
least  one  term  in  the  product  whose  coefficient  is  not  zero.  For 
this  purpose,  let  us  arrange  the  two  polynomials  /^  and  f,^  according 
to  descending  powers  of  x^, 

Here  we  may  assume  that  neither  <^5  nor  <^o'  vanishes  identically. 
Since /i  and/g  are  homogeneous,  </)o  and  </)o'  will  also  be  homogeneous 
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of  degrees  m^  —  k^  and  m^  —  h^  respectively.  In  the  product /^/j  the 
terms  of  highest  degree  in  x-^  will  be  those  in  the  product 

and  since  we  assume  our  theorem  to  hold  for  polynomials  in  n  —  1 
variables,  <^o  </)o'  will  be  a  polynomial  of  degree  m-^-{-m^  —  h-^  —  h^. 
Any  term  in  this  product  whose  coefficient  is  not  zero  gives  us  when 
multiplied  by  x-^-^^^^  a  term  of  the  product /^/g  ^^  degree  m^  +  m^ 
whose  coefficient  is  not  zero.  Thus  our  theorem  is  proved  for  the 
case  of  homogeneous  polynomials. 

Let  us  now,  in  the  general  case,  write /^  and/2  in  the  forms 

J'zi^l')  ' '  •  ^n)  —  9w2(^li  * ' '  ^n)  +  Ym2-l\^l'>  ' ' '  ^n)  +  *  *  'l 

where  <^/  and  <^/  are  homogeneous  polynomials  which  are  either  of 
degrees  i  and  j  respectively,  or  which  vanish  identically.  Since, 
by  hypothesis,  /j  and  f^  are  of  degrees  mi  and  W2  respectively, 
04^  and  </)^^  will  not  vanisli  identically,  but  will  be  of  degrees 
THi  and  9712- 

The  terms  of  highest  degree  in  the  product  /1/2  will  therefore  be 
the  terms  of  the  product  cj)',,,^  4>'J^^,  and  this  being  a  product  of  homo- 
geneous polynomials  comes  under  the  case  just  treated  and  is  there- 
fore of  degree  mi  -\-  ^2-  The  same  is  therefore  true  of  the  product 
/1/2,  and  our  theorem  is  proved. 

By  a  successive  application  of  this  theorem  we  infer 

Corollary.  If  k  polynomials  are  of  degrees  mj,  mg,  •••  Wj.  re- 
spectively/^ their  product  is  of  degree  m^  +  m^-^  ••-  -\-  m^. 

We  mention  further,  on  account  of  their  great  importance,  the 
two  rather  obvious  results  : 

Theorem  4.  If  the  product  of  two  or  more  polynomials  is  identi- 
cally zero,  at  least  one  of  the  factors  must  he  identically  zero. 

For  if  none  of  them  were  identically  zero,  they  would  all  have 
definite  degrees,  and  therefore  their  product  would,  by  Theorem  3, 
have  a  definite  degree,  and  would  therefore  not  vanish  identically. 

It  is  from  this  theorem  that  we  draw  our  justification  for  cancel- 
ling out  from  an  identity  a  factor  which  we  know  to  be  not  identi- 
cally zero. 
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Theorem  5.  If  f{xy^  •  •  •  x^  is  a  polynomial  which  is  not  identically 
zero,  and  if  <\){x-^^  •••  a;^)  vanishes  at  all  points  where  f  does  not  vanish^ 
then  (j)  vanishes  identically. 

This  follows  from  Theorem  4  when  we  notice  that  /(/>  =0. 

EXERCISES 

1.  If  /  and  0  are  polynomials  in  any  number  of  variables,  what  can  be  inferred 
from  the  identity  /^  =  <^2  concerning  the  relation  between  the  polynomials  /  and  ^  .■' 

2.  If  /i  and  f^  are  polynomials  in  (a:j,  •••  Xn)  which  are  of  degrees  m^  and  i>i., 
respectively  in  xi,  prove  that  their  product  is  of  degree  m^  +  m^  in  Xy 

3.  Geometric  Interpretations.  In  dealing  with  functions  of  a 
single  real  variable,  the  different  values  which  the  variable  may 
take  on  may  be  represented  geometrically  by  the  points  of  a  line  ; 
it  being  understood  that  when  we  speak  of  a  point  x  we  mean  the 
point  which  is  situated  on  the  line  at  a  distance  of  x  units  (to  the 
right  or  left  according  as  x  is  positive  or  negative)  from  a  certain 
fixed  origin  0,  on  the  line.  Similarly,  in  the  case  of  functions  of 
two  real  variables,  the  sets  of  values  of  the  variables  may  be  pictured 
geometrically  by  the  points  of  a  plane,  and  in  the  case  of  three  real 
variables,  by  the  points  of  space  ;  the  set  of  values  represented  by  a 
point  being,  in  each  case,  the  rectangular  coordinates  of  that  point. 
When  we  come  to  functions  of  four  or  more  variables,  however,  this 
geometric  representation  is  impossible. 

The  complex  variable  x  =  ^  -\-r]i  depends  on  the  two  independent 
real  variables  |  and  r)  in  such  a  way  that  to  every  pair  of  real  values 
(f ,  7))  there  corresponds  one  and  only  one  value  of  x.  The  different 
values  which  a  single  complex  variable  may  take  on  may,  therefore, 
be  represented  by  the  points  of  a  plane  in  which  (f,  t))  are  used  as 
cartesian  coordinates.  In  dealing  with  functions  of  more  than  one 
complex  variable,  however,  this  geometric  representation  is  impos- 
sible, since  even  two  complex  variables  a:  =  J^-?;^,  y  =  ^^-{-Tj^i  are 
equivalent  to  four  real  variables  (|,  ?;,  |p  r]^). 

By  the  neighborhood  of  a  point  a;  =  a  we  mean  that  part  of  the 
line  between  the  points  x  =  a  —  a  and  x=  a-\-  a  {a  being  an  arbitrary 
positive  constant,  large  or  small),  or  what  is  the  same  thing,  all 
points  whose  coordinates  x  satisfy  the  inequality  |  a;  —  a  |  <  a.* 

*  We  use  the  symbol   |Z  |  to  denote  the  absolute  value  of  Z,  i.e.  the  numerical 
value  of  Z  if  Z  is  real,  the  modulus  of  Z  if  Z  is  imaginary. 
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Similarly,  by  the  neighborhood  of  a  point  (a,  b)  in  a  plane,  we 
shall  mean  all  points  whose  coordinates  (rr,  7/)  satisfy  the  inequalities 

where  «  and  y8  are  positive  constants.  This  neighborhood  thus  con- 
sists of  the  interior  of  a  rectangle  of  which  (a,  h)  is  the  center  and 
whose  sides  are  parallel  to  the  coordinate  axes. 

By  the  neighborhood  of  a  point  (a,  5,  c)  in  space  we  mean  all 
points  whose  coordinates  (x^  y^  z)  satisfy  the  inequalities 

\x—a\<a^     ly  — 5|</3,     |;2— c|<7. 

In  all  these  cases  it  will  be  noticed  that  the  neighborhood 
may  be  large  or  small  according  to  the  choice  of  the  constants 
a,  ^,  7. 

If  we  are  dealing  with  a  single  complex  variable  x  =  ^  -{-  rji^  we 
understand  by  the  neighborhood  of  a  point  a  all  points  in  the  plane 
of  complex  quantities  whose"  complex  coordinate  x  satisfies  the  in- 
equality \x  —  a\<a^  a  being  as  before  a  real  positive  constant.  Since 
|a?  —  a|  is  equal  to  the  distance  between  x  and  a,  the  neighborhood  of 
a  now  consists  of  the  interior  of  a  circle  of  radius  a  described  about 
a  as  center. 

It  is  found  convenient  to  extend  the  geometric  terminology 
we  have  here  introduced  to  the  case  of  any  number  of  real  or 
complex  variables.  Thus  if  we  are  dealing  with  n  independent 
variables  {x^^  x^^  •••  x^)-,  we  speak  of  any  particular  set  of  values 
of  these  variables  as  a  point  in  space  of  n  dimensions.  Here 
we  have  to  distinguish  between  real  points^  that  is  sets  of  values 
of  the  a;'s  which  are  all  real,  and  imaginary  points  in  which 
this  is  not  the  case.  In  using  these  terms  we  do  not  propose 
even  to  raise  the  question  whether  in  any  geometric  sense  there 
is  such  a  thing  as  space  of  more  than  three  dimensions.  We 
merely  use  these  terms  in  a  wholly  conventional  algebraic 
sense  because  on  the  one  hand  they  have  the  advantage  of 
conciseness  over  the  ordinary  algebraic  terms,  and  on  the  other 
hand,  by  calling  up  in  our  minds  the  geometric  pictures  of  three 
dimensions  or  less,  this  terminology  is  often  suggestive  of  new 
relations  which  might  otherwise  not  present  themselves  to  us  so 
readily. 
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By  the  neighborhood  of  the  point  (a^,  a^,  •••  a„)  we  understand  all 
points  which  satisfy  the  inequalities 


^1  ~"  ^ll  ^  "r        1^2  ~  ^2!  ^  ^2'  ***  l^»  —  ^n\<^: 


W) 


where  a^,  «2'  *'*  ^«  ^^'^  ^^^^  positive  constants. 

If,  in  particular,  {a^^  a^^  •••  a„)  is  a  real  point,  we  may  speak  of 
the  real  neighborhood  of  this  point,  meaning  thereby  all  real  points 
(iTj,  x^^  •••  Xn)  which  satisfy  the  above  inequalities. 

As  an  illustration  of  the  use  to  which  the  conception  of  the 
neighborhood  of  a  point  can  be  put  in  algebra,  we  will  prove  the 
following  important  theorem : 

Theorem  1.  A  necessary  and  sufficient  condition  that  a  poly- 
nomial f{x^^  '"  Xn)  vanish  identically  is  that  it  vanish  throughout  the 
neighborhood  of  a  point  (a^,  •••  a^). 

That  this  is  a  necessary  condition  is  obvious.  To  prove  that  it 
is  sufficient  we  begin  with  the  case  n  =  \. 

Suppose  then  that /(a;)  vanishes  throughout  a  certain  neighbor- 
hood of  the  point  x  =  a.  lif{x)  did  not  vanish  identically,  it  would 
be  of  some  definite  degree,  say  k^  and  therefore  could  not  vanish  at 
more  than  k  points  (cf.  Theorem  3,  §  1).  This,  however,  is  not  the 
case,  since  it  vanishes  at  an  infinite  number  of  points,  namely  all 
points  in  the  neighborhood  of  x  =  a.  Thus  our  theorem  is  proved 
in  the  case  n  =  \. 

Turning  now  to  the  case  71  =  2,  let 

fix,  y)  =  4>Q{y)x'  +  (t>^{y)x'-'+  ...  ■\- My) 

be  a  polynomial  which  vanishes  throughout  a  certain  neighborhood 
of  the  point  {a,  5),  say  when 


\x 


^l<"?     l3/~^|^/^* 


Let  yQ  be  any  constant  satisfying  the  inequality 

Then  f(x,  y^)  is  a  polynominal  in  x  alone  which  vanishes  whenever 
\x  —  a\<a.  Hence,  by  the  case  w  =  1  of  our  theorem,  f(x,  y^)  =  0. 
That  is. 
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Thus  all  these  polynomials  (f)  vanish  at  every  point  ^q  in  the  neigh- 
borhood of  «/  =  ^,  and  therefore,  by  the  case  n  =  l  oi  our  theorem, 
they  are  all  identically  zero.  From  this  it  follows  that  for  every 
value  of  X,  f{x^  y)  vanishes  for  all  values  of  y^  that  is  /=  0,  and 
our  theorem  is  proved. 

We  leave  to  the  reader  the  obvious  extension  of  this  method  of 
proof  to  the  case  of  n  variables  by  the  use  of  mathematical  induction. 

From  the  theorem  just  proved  we  can  infer  at  once  the  following: 

Theorem  2.  A  necessary  and  sufficient  condition  that  two  polyno- 
mials 171  the  variables  (x-^^  •  •  •  x^)  he  identically  equal  is  that  they  be 
equal  throughout  the  neighborhood  of  a  point  (a^,  •  •  •  a^). 

EXERCISES 

1.  Theorem  3,  §  1  may  be  stated  as  follows :  If  /  is  a  polynomial  in  one 
variable  which  is  known  not  to  be  of  degree  higher  than  n,  then  if  /  vanishes  at 
n  +  1  distinct  points,  it  vanishes  identically. 

EstabUsh  the  following  generalization  of  this  theorem  : 

If /is  a  polynomial  in  (x,  y)  which  is  known  not  to  be  of  higher  degree  than 
n  in  X,  and  not  of  higher  degree  than  m  in  y,  then,  if  /  vanishes  at  the 
(n  +  1)  (m  +  1)  distinct  points :  /  •  _  i    o  ,  i  \ 

(^■.»)  [iZit'ZlVii' 

it  vanishes  identically.  -^        '-?•••       + 

2.  Generalize  the  theorem  of  Exercise  1  to  polynomials  in  any  number  of 
variables. 

3.  Prove  Theorem  4,  §  2  by  means  of  Theorem  1  of  the  present  section  ;  and 
from  this  result  deduce  Theorem  3,  §  2. 

4.  Do  Theorems  1  and  2  of  this  section  hold  if  we  consider  only  real  polyno- 
mials and  the  real  neighborhoods  of  real  points  ? 

4.  Homogeneous  Coordinates.  Though  only  two  quantities  are 
necessary  in  order  to  locate  the  position  of  a  point  in  a  plane,  it  is 
frequently  more  convenient  to  use  three,  the  precise  values  of  the 
quantities  being  of  no  consequence,  but  only  their  ratios.  We  will 
represent  these  three  quantities  by  a;,  y,  t,  and  define  their  ratios  by 
the  equations  x      ^  V      ir 

where  X  and  Y  are  the  cartesian  coordinates  of  a  point  in  a  plane. 
Thus  (2,  3,  5)  will  represent  the  point  whose  abscissa  is  f  and  whose 
ordinate  is  |^.     Any  set  of  three  numbers  which  are  proportional  to 
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(2,  3,  5)  will  represent  the  same  point.  So  that,  while  to  every 
set  of  three  numbers  (with  certain  exceptions  to  be  noted  below) 
there  corresponds  one  and  only  one  point,  to  each  point  there  cor- 
respond an  infinite  number  of  different  sets  of  three  numbers,  all 
of  which,  however,  are  proportional. 

When  ^  =  0  our  definition  is  meaningless ;  but  if  we  consider 
the  points  (2,  3,  1),  (2,  3,  0.1),  (2,  3,  0.01),  (2,  3,  0.001),  •••,  which 
are,  in  cartesian  coordinates,  the  points  (2,  3),  (20,  30),  (200,  300), 
(2000,  3000),  •••,  we  see  that  they  all  lie  on  the  straight  line  through 
the  origin  whose  slope  is  |.  Thus  as  t  approaches  zero,  x  and  y 
remaining  fixed  but  not  both  zero,  the  point  (a:,  ^,  t)  moves  away 
along  a  straight  line  through  the  origin  whose  slope  is  y/x.  Hence 
it  is  natural  to  speak  of  (rr,  y^  0)  as  the  point  at  infiiiity  on  the  line 
whose  slope  is  y/x.  If  t  approaches  zero  through  negative  values, 
the  point  will  move  off  along  the  same  line,  but  in  the  opposite 
direction.  We  will  not  distinguish  between  these  two  cases,  but 
will  speak  of  only  one  point  at  infinity  on  any  particular  line.  It 
can  be  easily  verified  that  if  a  point  moves  to  infinity  along  any  line 
parallel  to  the  one  just  considered,  its  homogeneous  coordinates  may 
be  made  to  approach  the  same  values  (x^  «/,  0)  as  those  just  obtained. 
It  is  therefore  natural  to  speak  of  the  point  at  infinity  in  a  cer- 
tain direction  rather  than  on  a  definite  line.  Finally  we  will  agree 
that  two  points  at  infinity  whose  coordinates  are  proportional  shall 
be  regarded  as  coinciding,  since  these  coordinates  may  be  regarded 
as  the  limits  of  the  coordinates  of  one  and  the  same  point  which 
moves  further  and  further  off.* 

If  a;  =  J/  =  i  =  0,  we  will  not  say  that  we  have  a  point  at  all,  since 
the  coordinates  of  any  point  whatever  may  be  taken  as  small  as  we 
please,  and  so  (0,  0,  0)  might  be  regarded  as  the  limits  of  the  coor- 
dinates of  any  fixed  or  variable  point. 

*  It  should  be  noticed  that  in  speaking  of  points  at  infinity  we  are,  considering 
the  matter  from  a  purely  logical  point  of  view,  doing  exactly  the  same  thing  that  we 
did  in  §  3  in  speaking  of  imaginary  points,  or  points  in  space  of  n  dimensions ;  that  is, 
we  are  speaking  of  a  set  of  quantities  as  a  "point"  which  are  not  the  coordinates  of 
any  point.  The  only  difference  between  the  two  cases  is  that  the  coordinates  of  our 
"  point  at  infinity  "  are  the  limits  of  the  coordinates  of  a  true  point. 

Thus,  in  particular,  it  is  a  pure  convention,  though  a  desirable  and  convenient 
one,  when  we  say  that  two  points  at  infinity  shall  be  regarded  as  coincident  when  and 
only  when  their  coordinates  are  proportional.  We  might,  if  we  chose,  regard  all 
points  at  infinity  as  coincident.     There  is  no  logical  compulsion  in  the  matter. 
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The  equation 

AX^  +  BXY+  CY^+  DX+EY+  F=  0 

becomes,  in  homogeneous  coordinates, 

or  Ax^  +  Bxt/  +  Of  +  Bxt  +  Ei/t  +  Ft^  =  0, 

a  homogeneous  equation  of  the  second  degree ;  and  it  is  evident  that 
if  the  coordinates  X^  Y  in  any  algebraic  equation  be  replaced  by 
the  coordinates  x,  y,  t,  the  resulting  equation  will  be  homogeneous, 
and  of  the  same  degree  as  the  original  equation.  It  is  to  this  fact 
that  the  system  owes  its  name,  as  well  as  one  of  its  chief  advantages. 
The  equation  Ax  +  By+Ct=0 

represents,  in  general,  a  line,  but  if  J.  =  ^  =  0,  (7=?^  0,  it  has  no  true 
geometric  locus.  It  is,  in  this  case,  satisfied  by  the  coordinates  of  all 
points  at  infinity,  and  by  the  coordinates  of  no  other  point.  We  shall 
therefore  speak  of  it  as  the  equation  of  the  line  at  infinity.  The  reader 
may  easily  verify,  by  using  the  equation  of  a  line  in  terms  of  its  inter- 
cepts, that  if  a  straight  line  move  further  and  further  away,  its  homo- 
geneous equation  will  approach  more  and  more  nearly  the  form  ^=0. 
In  space  of  three  dimensions  we  will  represent  the  point  whose 
cartesian  coordinates  are  JT,  Y^  Z  by  the  four  homogeneous  coordi- 
nates a:,  y^  z^  t^  whose  ratios  are  defined  by  the  equations 

t  t         t 

We  will  speak  of  {x^  y,  2,  0)  as  "  the  point  at  infinity  "  on  a  line 
whose  direction  cosines  are 

^     X  '    y  z 


Va;^  -\-  y'^-\-  z^      ^^  -\-  y^-\-z^       Va;^  -\-  y^  -^z^ 

(0,  0,  0,  0)  will  be  excluded,  and  ^  =  0  will  be  spoken  of  as  the  equa- 
tion of  the  plane  at  infinity. 

Extending  the  same  terminology  to  the  general  case,  we  shall 
sometimes  find  it  convenient  to  speak  of  (a;^,  0^2,  •••  x^)  not  as  a  point 
in  space  of  n  dimensions,  but  as  a  point  represented  by  its  homo- 
geneous  coordinates   in   space   of  ^  —  1    dimensions.     Two    points 


14  INTRODUCTION  TO  HIGHER  ALGEBRA 

whose  coordinates  are  proportional  will  be  spoken  of  as  identical, 
a  point  whose  last  coordinate  is  zero  will  be  spoken  of  as  a  point  at 
infinity,  and  the  case  Xi=  •••  =a;„  =  0  will  not  be  spoken  of  as  a 
point  at  all.  This  terminology  will  be  adopted  only  in  connection 
with  homogeneous  polynomials,  and  even  then  it  must  be  clearly 
understood  that  we  are  perfectly  free  to  adopt  whichever  terminol- 
ogy we  find  most  convenient.  Thus,  for  instance,  if  f{x^,  x^^  x^  is 
a  homogeneous  polynomial  of  the  second  degree,  the  equation /=  0 
may  be  regarded  either  as  determining  a  conic  in  a  plane  {x^^  x^,  x^ 
being  homogeneous  coordinates)  or  a  quadric  cone  in  space  {x-^^  x^^  x^ 
being  ordinary  cartesian  coordinates). 

Homogeneous  coordinates  may  also  be  used  in  space  of  one 
dimension.  We  should  then  determine  the  points  on  a  line  by  two 
coordinates  x^  t  whose  ratio  x/t  is  the  non-homogeneous  coordinate 
JT,  i.e.  the  distance  of  the  point  from  the  origin.  It  is  this  repre- 
sentation that  is  commonly  made  use  of  in  connection  with  the 
theory  of  binary  forms. 

5.  The  Continuity  of  Polynomials. 

Definition.  A  function  f  {x^^  •••  x^)  is  said  to  be  continuous  at  the 
point  (^1,  •••  c„)  if  no  matter  how  small  a  positive  quantity/  e  be  chosen^ 
a  neighborhood  of  the  point  (c^,  •••  c^)  can  befourid  so  small  that  the  dif- 
ference between  the  value  of  the  function  at  any  point  of  this  neighbor- 
hood and  its  value  at  the  point  (cj,  •••  (?„)  is  in  absolute  value  less  than  e. 

That  is,  /  is  continuous  at  ((?i,  •••  c^)  if,  having  chosen  a  positive 
quantity  €,  it  is  possible  to  determine  a  positive  3  such  that 

\f{x^,'--x^)-f{c^,'-'C^)\<e 

for  all  values  of  {x^  •••  x^  which  satisfy  the  inequalities. 

Theorem  1.  //  two  functions  are  continuous  at  a  point.,  their 
sum  is  continuous  at  this  point. 

Let/i  and/2  be  two  functions  continuous  at  the  point  (6?i,  •••  c^) 
and  let  k^  and  A^g  be  their  respective  values  at  this  point.  Then,  no 
matter  how  small  the  positive  quantity  e  may  be  chosen,  we  may 
take  Sj  and  h^  so  small  that 

1/1  —  ^1 1  <  2  ^  when  \xi  —  Ci\<  Sj, 

1/2  —  -^2 1  <  2  ^  when  \xi  —  Ci\  <  h^. 
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Accordingly 

l/i  -  ^1 1  +  1/2  -  ^2 1  <  €  when  \xi  —  Ci\  <  8, 

where  8  is  the  smaller  of  the  two  quantities  8^  and  ^2 ;  ^^^-i  since 

|^|  +  |^|^|J.  +  ^|,  we  have 
1/1  -  ^1 +/2 -  h\  =  l(/i  +/2) -  (^1  +  ^2)1  <  ^     when  \xi -Ci\<8. 
Hence /j  4-/2  is  continuous  at  the  point  (c^  •••  c„). 

Corollary.  If  a  finite  number  of  functions  are  continuous  at  a- 
pointy  their  sum  is  continuous  at  this  point. 

Theorem  2.  If  two  functions  are  continuous  at  a  pointy  their 
product  is  continuous  at  this  point. 

Let/j  and/2  be  the  two  functions,  and  k^  and  k^  their  values  at 
the  point  (cj,  •  •  •  c^  where  they  are  assumed  to  be  continuous.  We 
have  to  prove  that  however  small  e  may  be,  8  can  be  chosen  so  small 
that 

(1)  1/1/2  -k^k^\<e  when  \xi- Ci\<8. 

Let  1;  be  a  positive  constant,  which  we  shall  ultimately  restrict  to  a 
certain  degree  of  smallness,  and  let  us  choose  two  positive  constants 
8^  and  82  such  that  |^^  -k,\<v  when  \x,-c,\<  8,, 

1/2  —  ^2!  <  ^  when  \Xi  —  Ci\<  gg. 

Now  take  8  as  the  smaller  of  the  two  quantities  8-^  and  82.  Then, 
when  I  Xi  —  ^^  |  <  S, 

1/1/2  -  -^1*21  =  l/2(/i  -  ^1)  +  h(A  -  h)\ 

^  1/2!  1/1 -*il +  1-5:11  l/i -■^2l^!|/2l  +  l*ll!'?- 
Accordingly  since,  when  \x.i  —  c^\<  8, 

I/2I  =  1^2  +  (/2  -  ^2)1  ^  1^2!  +  1/2  -  ^2!  <  1^2!  +  ^, 

we  may  write 

(2)  1/1/2- -^1*2!  <[|^l|  +  |^2|i '/  +  '?'• 

If  k^  and  A^g  ^r^  not  both  zero,  let  us  take  r)  small  enough  to  satisfy 
the  two  inequalities 
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If  h-^  —  k^  =  0,  we  will  restrict  rj  merely  by  the  inequality 

7]  <  Ve. 

In  either  case,  inequality  (2)  then  reduces  to  the  form  (1)  and  our 
theorem  is  proved. 

Corollary.  If  a  finite  number  of  functions  are  continuous  at  a 
pointy  their  product  is  continuous  at  this  point. 

Referring  now  to  our  definition  of  continuity,  we  see  that  any 
constant  may  be  regarded  as  a  continuous  function  of  (a;^,  •••  x^)  for 
all  values  of  these  variables,  and  that  the  same  is  true  of  any  one  of 
these  variables  themselves.  Hence  by  the  last  corollary  any  function 
of  the  form  Cx^^  •••  x\  where  the  A;'s  are  integers  positive  or  zero,  is 
continuous  at  every  point.  If  we  now  refer  to  the  corollary  to 
Theorem  1,  we  arrive  at  the  theorem: 

Theorem  3.  Any  polynomial  is  a  continuous  function  for  all 
values  of  the  variables* 

Finally,  we  give  a  simple  application  of  this  theorem. 

Theorem  4.  If  f{xy,  •••  x^  is  a  polynomial  and  fic-^^  •••  <?„)9^0, 
it  is  possible  to  take  a  neighborhood  of  the  point  (c?^,  •••  c^)  so  small  that 
f  does  not  vanish  at  any  point  in  this  neighborhood. 

Let  k=f{cy,  •••  (?„).  Then,  on  account  of  the  continuity  of/  at 
(^1^  •••  (^n)^  ^  positive  quantity  h  can  be  chosen  so  small  that  through- 
out the  neighborhood  \Xi  —  Cj|  <  8,  the  inequality 

\f-k\<l\k\ 

is  satisfied.     In  this  neighborhood  /  cannot  vanish ;  for  at  any  point 
where  it  vanished  we  should  have 

\f-k\  =  \k\<i\k\, 

which  is  impossible  since  by  hypothesis  k^O. 

6.  The  Fundamental  Theorem  of  Algebra.  Up  to  this  point  no 
use  has  been  made  of  what  is  often  known  as  the  fundamental 
theorem  of  algebra,  namely  the  proposition  that  every  algebraic 
equation  has  a  root.  This  fact  we  may  state  in  more  precise  form 
as  follows : 
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Theorem  1.  If  f(x)  is  a  polynomial  of  the  nth  degree  where 
n  ^  1,  there  exists  at  least  one  value  of  x  for  which  f(x)  =  0. 

This  theorem,  fundamental  though  it  is,  is  not  necessary  for 
most  of  the  developments  in  this  book.  Moreover,  the  methods  of 
proving  the  theorem  are  essentially  not  algebraic,  or  only  in  part 
algebraic.  Accordingly,  we  will  give  no  proof  of  the  theorem  here, 
but  merely  refer  the  reader  who  desires  a  formal  proof  to  any  of  the 
text-books  on  the  theory  of  functions  of  a  complex  variable.  We 
shall,  however,  when  we  find  it  convenient  to  do  so,  assume  the 
truth  of  this  theorem.  In  this  section  we  will  deduce  a  few  of  its 
more  immediate  consequences. 

Theorem  2.     If  f(x)  is  a  polynomial  of  the  nth  degree^ 

f{x)=aQx''  +  a^x^-^+  •••  +an_^x  +  an  (^^=7^=0), 

there  exists  one  and  only  one  set  of  constants^  «j,  a^^  •••  a„,  such  that 
f{x)=aQ{x  -  a^)(x  -  «2)  •••  (^  -  ««)• 

This  theorem'  is  seen  at  once  to  be  true  for  polynomials  of  the 
first  degree.  Let  us  then  use  the  method  of  mathematical  induction 
and  assume  the  proposition  true  for  all  polynomials  of  degree  less 
than  n.  If  we  can  infer  that  the  theorem  is  true  for  polynomials  of 
the  nth.  degree,  it  follows  that  being  true  for  those  of  the  first  degree 
it  is  true  for  those  of  the  second,  hence  for  those  of  the  third,  etc. 

By  Theorem  1   we  see  that  there  is  at  least  one  value  of  x  for 

which /(a;)  =  0.     Call  such  a  value  a^     By  Theorem  1,  §  1  we  may 

write 

f(x)  =  (x-a^)(t)(x), 

where  <j)(x)  =  aQx''~'^ -hh^x'"'^ -{-  •"  +1^-1- 

Since  <f>(x)  is  a  polynomial  of  degree  n  —  1,  and  since  we  are  assum- 
ing our  theorem  to  be  true  for  all  such  polynomials,  there  exist 
n  —  l  constants  a^,  •••  a^  such  that 

<l)(x)  =  a^{x  -a^)"'{x-  a^). 

Hence  f{x)  =  a^ix  —  a^ {x  —  a^)-"  (x  —  a„). 

Thus  half  of  our  theorem  is  proved. 
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Suppose  now  there  were  two  such  sets  of  constants,  ccj,  •••  a„  and 
/^i,  •••  /3„.     We  should  then  have 

(1)  f{x)  =  a^{x  -  «j)  ...  (a;  -  a„)  =  a^{x  -^^)...{x-^  /3^). 

Let  a;  =  «!  in  this  identity.     This  gives 

Accordingly,  since  a^^^O^  a^  must  be  equal  to  one  of  the  quan- 
tities /3j,  ^^2,  •••  /3„.  Let  us  suppose  the  /3's  to  have  been  taken  in 
such  an  order  that  a^  =  ^j.  Now  in  the  identity  (1)  cancel  out  the 
factor  «o(^~  ^i)  (^^^  Theorem  4,  §  2).     This  gives 

(x  -  «2)  •••  (^  -  a^)  =  (x-^.2) '"  (x  -  /3„). 

Accordingly,  since  we  have  assumed  the  theorem  we  are  proving 
to  be  true  for  polynomials  of  degree  n—  1,  the  constants  ^^^  •••  jS^ 
are  the  same,  except  perhaps  for  the  order,  as  the  constants  a^^  ...  a^, 
and  our  theorem  is  proved. 

Definition.  The  constants  a^,  •••  a^  determined  in  the  last  theo- 
rem are  called  the  roots  of  the  polynomial  f{x)^  or  of  the  equation 
f(x)  =  0.  If  k  of  these  roots  are  equal  to  one  another^  hut  different 
from  all  the  other  roots,  this  root  is  called  a  kfold  root. 

It  is  at  once  seen  by  reference  to  Theorem  1,  §  1  that  these  roots 
are  the  only  points  at  which /(a;)  vanishes. 

Theorem  3.  Iff{x^,  •••  x^is  a  polynomial  which  is  not  identi- 
cally equal  to  a  constant,  there  are  an  infinite  number  of  points  {x^,  ...  x^ 
at  which  f=^0,  and  also  an  infinite  number  at  which  f=  0,  provided 
n>l. 

The  truth  of  the  first  part  of  this  theorem  is  at  once  obvious,  for, 
since /is  not  identically  zero,  a  point  can  be  found  at  which  it  is  not 
zero,  and  then  a  neighborhood  of  this  point  can  be  taken  so  small 
that /does  not  vanish  in  this  neighborhood  (Theorem  4,  §  5).  This 
neighborhood,  of  course,  consists  of  an  infinite  number  of  points. 

To  prove  that  /  vanishes  at  an  infinite  number  of  points,  let  us 
select  one  of  the  variables  which  enters  into  /  to  at  least  the  first 
degree.  Without  loss  of  generality  we  may  suppose  this  variable 
to  be  Xy     We  may  then  write 

/(rrp  •••  x^)  =  Fq(x2,  ...  x^)xf -\- F^(x^,  ...  x^)xi-^  H h  Ff,(x2,  •"  x,) 
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where  k>l  and  F^  is  not  identically  zero.  Let  (cgi  •••  <?«)  be  any 
point  at  which  ^o  is  not  zero.  Then/(2:j,  c^^^  •••  c^)  is  a  polynomial 
of  the  ^th  degree  in  x-^  alone.  Accordingly,  by  Theorem  1,  there  is 
at  least  one  value  of  x^  for  which  it  vanishes.  If  c-^  is  such  a  value, 
/(cp  c^,  •••  Cn)  —  0.  Moreover,  by  the  part  of  our  theorem  already 
proved,  there  are  an  infinite  number  of  points  where  F^  4^  0,  that 
is  an  infinite  number  of  choices  possible  for  the  quantities  c^^  •••  c^. 
Thus  our  theorem  is  completely  proved. 

Finally,  we  will  state,  without  proof,  for  future  reference,  a 
theorem  which  says,  in  brief,  that  the  roots  of  an  algebraic  equation 
are  continuous  functions  of  the  coefficients : 


Theorem  4.     If  a  is  a  root  of  the  polynomial 
a^x""  +  a^x""-^  H h  a,,_^x  +  a^ 


.n-l    ,  I     ^        ^    I     ^  /ao=5tOV 


n  >  0 


then  no  matter  how  small  a  neighborhood  \x—  a|<e  of  the  point  a  we 
may  consider^  it  is  possible  to  take  in  space  of  n+1  dimensions  a  neigh- 
borhood of  the  point  (a^^a^^  •••  (h^)  so  small  that^  if  (b^tb^^  •••  J„)  is  any 
point  in  this  neighborhood,  the  polynomial 

^0^^  +  5ia:"-l  +  ...  +  5„_l2:  +  5„ 

has  at  least  one  root  /3  in  the  neighborhood  \x—  a\<e  of  the  point  a. 

For    a    proof    of    this   theorem   we   refer   to   Weber's   Algebra^ 
Vol.  1,  §  44. 

*  The  theorem  remains  true  if  we  merely  assume  that  the  polynomial  is  of  at  least 
the  first  degree.    That  is,  some  of  the  first  coefficients  ao?  «i,  •••  may  be  zero. 


CHAPTER  II 
A   FEW   PROPERTIES   OF   DETERMINANTS 

7.  Some  Definitions.  We  assume  that  the  reader  is  familiar  with 
the  determinant  notation,  and  will  merely  recall  to  him  that  by  a 
determinant  of  the  nth  order 
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we  understand  a  certain  homogeneous  polynomial  of  the  nth  degree 
in  the  n^  elements  a^j.  By  the  side  of  these  determinants  it  is  often 
desirable  to  consider  the  system  of  the  n^  elements  arranged  in  the 
order  in  which  they  stand  in  the  determinant,  but  not  combined 
into  a  polynomial.  Such  a  square  array  of  n^  elements  we  speak 
of  as  a  matrix.  In  fact,  we  will  lay  down  the  following  somewhat 
more  general  definition  of  this  term  : 

Definition  1.  A  system  of  mn  quantities  arranged  in  a  rectangu- 
lar array  of  m  rows  and  n  columns  is  called  a  matrix.  If  m  =  n,  we 
say  that  we  have  a  square  matrix  of  order  n. 

It  is  customary  to  place  double  bars  on  each  side  of  this  array, 
thus: 
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Sometimes  parentheses  are  used,  thus  : 


^21       ^22 
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2n 


a 


m2 


a. 


Even  when  a  matrix  is  square,  it  must  be  carefully  noticed  that 
it  is  not  a  determinant.  In  fact,  a  matrix  is  not  a  quantity  at  all,* 
but  a  system  of  quantities.  This  difference  between  a  square  ma- 
trix and  a  determinant  is  clearly  brought  out  if  we  consider  the 
effect  of  interchanging  columns  and  rows.  This  interchange  has  no 
effect  on  a  determinant,  but  gives  us  a  wholly  new  matrix.  In  fact, 
we  will  lay  down  the  definition: 

Definition  2.     Two  square  matrices 
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a. 


of  which  either  is  obtained  from  the  other  by  interchanging  roivs  and 
columns  are  called  conjugate  f  to  each  other. 

Although,  as  we  have  pointed  out,  square  matrices  and  deter- 
minants are  wholly  different  things,  every  determinant  determines  a 
square  matrix,  the  matrix  of  the  determinant^  and  conversely  every 
square  matrix  determines  a  determinant,  the  determinant  of  the 
matrix. 

Every  matrix  contains  other  matrices  obtained  from  it  by  strik- 
ing out  certain  rows  or  columns  or  both.  In  particular  it  contains 
certain  square  matrices  ;  and  the  determinants  of  these  square 
matrices  we  will  call  the  determinants  of  the  matrix.  If  the  matrix 
contains  m  rows  and  n  columns,  it  will  contain  determinants  of  all 
orders  from  1  (the  elements  themselves)  to  the  smaller  of  the  two 
integers  and  m  and  n  inclusive.  J     In  many  important  problems  all 

*  Cf.,  however,  §  21.  t  Sometimes  also  transposed. 

t  If  m  =  w,  there  is  only  one  of  these  determinants  of  highest  order,  and  it  was  this 
which  we  called  above  the  determinant  of  the  square  matrix. 
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of  these  determinants  above  a  certain  order  are  zero,  and  it  is  often 
of  great  importance  to  specify  the  order  of  the  highest  non- vanish- 
ing determinant  of  a  given  matrix.  For  this  purpose  we  lay  down 
the  following  definition  : 

Definition  3.  A  matrix  is  said  to  he  of  rank  r  if  it  contains  at 
least  one  r-rowed  determinant  which  is  not  zero^  while  all  determinants 
of  order  higher  than  r  which  the  matrix  may  contain  are  zero, 

A  matrix  is  said  to  be  of  rank  0  if  all  its  elernents  are  zero. 

For  brevity,  we  shall  speak  also  of  the  rank  of  a  determinant, 
meaning  thereby  the  rank  of  the  matrix  of  the  determinant. 

We  turn  now  to  certain  definitions  concerning  the  minors  of 
determinants  ;  that  is,  the  determinants  obtained  from  the  given 
determinant  by  striking  out  certain  rows  and  columns. 

It  is  a  familiar  fact  that  to  every  element  of  a  determinant 
corresponds  a  certain  first  minor ;  namely,  the  one  obtained  by 
striking  out  the  row  and  column  of  the  determinant  in  which 
the  given  element  lies.  Now  the  elements  of  a  determinant 
of  the  nth.  order  may  be  regarded  as  its  {n  —  l)th  minors. 
Accordingly  we  have  here  a  method  of  pairing  off  each  one- 
rowed  minor  of  a  given  determinant  with  one  of  its  {n  —  l)-rowed 
minors. 

Similarly,  if  iHf  is  a  two-rowed  minor  of  a  determinant  of  the 
nth  order  i),  we  may  pair  it  off  against  the  (n  —  2)-rowed  minor  W 
obtained  by  striking  out  from  D  the  two  rows  and  columns  which 
are  represented  in  M.  The  two  minors  M  and  iV  we  will  speak  of 
as  complementary.     Thus,  in  the  determinant 
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In  the  same  way  we  pair  off  with  every  three-rowed  minor  an 
(n—  3)-rowed  minor;  etc.     In  general  we  lay  down 

Definition  4.  If  D  is  a  determinant  of  the  nth  order  and  M 
one  of  its  h-rowed  minors^  then  the  {n  —  kyrowed  minor  N  obtained  by 
striking  out  from  D  all  the  rows  and  columns  represented  in  M  is  called 
the  complement  of  M. 

Conversely,  itf  is  clearly  the  complement  of  N, 

Let  us  go  back  now  for  a  moment  to  the  case  of  the  one-rowed 
minors ;  that  is  to  the  elements  themselves.  Let  a^  be  the  element  of 
the  determinant  I)  which  stands  in  the  ^th  row  and  the  /th  column. 
Let  Dij  represent  the  corresponding  first  minor.  It  will  be  recalled 
that  we  frequently  have  occasion  to  consider  not  this  minor  By  but 
the  cof actor  Ay  of  a^-  defined  by  the  equation     -4y  =  (—  Vf'^^Dy. 

Similarly,  it  is  often  convenient  to  consider  not  the  complement 
of  a  given  minor  but  its  algebraic  compleme7it^  which  in  the  case  just 
mentioned  reduces  to  the  cofactor,  and  which,  in  general,  we  define 
as  follows : 

Definition  5.  If  M  is  the  m-rowed  minor  of  D  in  which  the  rows 
hi^  '"  k^  and  the  columns  Zj,  •  •  •  Z^„  are  represented,  then  the  algebraic  com- 
plement of  M  is  defined  by  the  equation 

alg.  compl.  of  i[f=(-l/^+-+'^'^+^i+-  +  ^'"[compl.  oi  M~\, 

The  following  special  case  is  important : 

Definition  6.  By  a  principal  minor  of  a  determinant  B  is  under- 
stood a  minor  obtained  by  striking  out  from  B  the  same  rows  as  columns. 

Since  in  this  case,  using  the  notation  of  Definition  5,  we  have 

^1  "f"  •  •  •  "f"  ^m  ^^    ll~^  '"  ~^  li 


^mi 


it  follows  that  the  algebraic  complement  of  any  principal  minor  is  equal 
to  its  plain  complement. 

We  have  so  far  assumed  tacitly  that  the  orders  of  the  minors  we 
were  dealing  with  were  less  than  the  order  n  of  the  determinant 
itself.  By  the  ?^-rowed  minor  of  a  determinant  B  of  the  nth  order 
we  of  course  understand  this  determinant  itself.  The  complement 
of  this  minor  has,  however,  by  our  previous  definition  no  meaning. 
We  will  define  the  complement  in  this  case  to  be  1,  and,  by  Definition  5, 
this  will  also  be  the  algebraic  complement. 
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EXERCISE 

Prove  that,  if  M  and  N  are  complementary  minors,  either  M  and  N  are  the 
algebraic  complements  of  each  other,  or  —  iV  is  the  algebraic  complement  of 
M  and  —  M  is  the  algebraic  complement  of  N. 

8.  Laplace's  Development.  Just  as  the  elements  of  any  row  or 
column  and  their  corresponding  cofactors  may  be  used  to  develop  a 
determinant  in  terms  of  determinants  of  lower  orders,  so  the  y5;-rowed 
minors  formed  from  any  k  rows  or  columns  may  be  used,  along  with 
their  algebraic  complements,  to  obtain  a  more  general  development 
of  the  determinant,  due  to  Laplace,  and  which  includes  as  a  special 
case  the  one  just  referred  to.  In  order  to  establish  this  develop- 
ment, we  begin  with  the  following  preliminary  theorem : 

Theorem  1.  If  the  roivs  and  columns  of  a  determinant  D  he 
shifted  in  such  a  way  as  to  bring  a  certain  minor  M  into  the  upper  left- 
hand  corner  without  changing  the  order  of  the  rows  and  columns  either 
of  M  or  of  its  complement  iVi  then  this  shifting  will  change  the  sign  of 
D  or  leave  it  unchanged  according  as  —  N  or  N  is  the  algebraic  com- 
plement of  M. 

To  prove  this  let  us,  as  usual,  number  the  rows  and  columns  of 
i>,  beginning  at  the  upper  left-hand  corner,  and  let  the  numbers  of 
the  rows  and  columns  represented  in  iff,  arranged  in  order  of  increas- 
ing magnitude,  be  k^^-'-k^^  and  Z^,  •••Z^  respectively.  In  order  to 
effect  the  rearrangement  mentioned  in  the  theorem,  we  may  first 
shift  the  row  numbered  k^  upward  into  the  first  position,  thus  carry- 
ing it  over  k^  —  1  other  rows  and  therefore  changing  the  sign  of  the 
determinant  k-^  —  1  times.  Then  shift  the  row  numbered  k^^  into  the 
second  position.  This  carries  it  over  k^  —  2  rows  and  hence  changes 
the  sign  kc^  —  l  times.  Proceed  in  this  way  until  the  row  numbered 
kjn  has  been  shifted  into  the  mth  position.  Then  shift  the  columns 
in  a  similar  manner.     The  final  result  is  to  multiply  D  by 


/_  ;J^Ui+-.+  *m+/i+-+?7n-2(l+2+ •••+?«)  =  (  _  1^1+-  +km-¥h+ - 


■¥Jm 


Comparing  this  with  Definition  5,  §  7,  the  truth  of  our  theorem  is 
obvious. 

Lemma.  If  M  is  a  minor  of  a  determinant  i>,  the  product  of  M 
by  Vs  algebraic  complement  is  identical^  when  expanded^  with  some  of 
the  terms  of  the  expansion  of  D. 
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Let  i>  = 


^11        *"        ^In 


^nl        '"        ^r 


and  call  the  order  of  M^  m,  and  its  complement  N.  We  will  first 
prove  our  lemma  in  the  special  case  in  which  Jf  stands  in  the  tipper 
left-hand  corner  of  i>,  so  that  iV,  which  in  this  case  is  the  algebraic 
complement,  is  in  the  lower  right-hand  corner.  What  we  have  to 
show  here  is  that  the  product  of  any  term  of  Mhj  Si  term  of  iV  is  a 
term  of  i>,  and  that  this  term  does  not  come  in  twice  to  the  product 
MJV.     Any  term  of  M  may  be  written  * 

where  the  integers  Zj,  l^'^'-'lm  are  merely  some  arrangement  of  the 
integers  1,  2,  •••  m,  and  ^i  is  the  number  of  inversions  of  order  in  this 
arrangement.     Similarly,  any  term  of  N  may  be  written 

where  lm+i->  "•  ^n  is  merely  some  arrangement  of  the  integers  m-fl, 
•••71,  and  V  is  the  number  of  inversions  of  order  in  this  arrangement. 
The  product  of  these  two  terms 

is  a  term  of  2),  for  the  factors  a  are  chosen  in  succession  from  the 
first,  second,  •••  nth.  rows  of  D,  and  no  two  are  from  the  same  col- 
umn, and  /i  +  z^  is  clearly  precisely  the  number  of  inversions  of  order 
in  the  arrangement  Z^,  l^^  •  •  •  Z„,  as  compared  to  the  natural  arrange- 
ment, 1,  2,  •••  n,  of  these  integers. 

Having  thus  proved  our  lemma  in  the  special  case  in  which  M 
lies  in  the  upper  left-hand  corner  of  D,  we  now  pass  to  the  general 
case.  Here  we  "may,  by  shifting  rows  and  columns,  bring  iHf  into 
the  upper  left-hand  corner  and  N  into  the  lower  right-hand  corner. 
JThis  has,  by  Theorem  1,  the  effect  of  leaving  each  term  in  the 
expansion  of  D  unchanged,  or  of  reversing  the  sign  of  all  of  them 
according  as  iVor  —  iVis  the  algebraic  complement  of  M.  Accord- 
ingly, since  the  product  MN  gives,  as  we  have  just  seen,  terms  in 
the  expansion  of  this  rearranged  determinant,  the  product  of  M  by 
its  algebraic  complement  gives  terms  in  the  expansion  of  D  itself, 
as  was  to  be  proved. 


26 


INTRODUCTION   TO   HIGHER  ALGEBRA 


Laplace's  Development,  which  may  be  stated  in  the  form  of  the 
following  rule,  now  follows  at  once : 

Theorem  2.  Pick  out  any  m  rows  (or  columns')  from  a  determi- 
nant 7),  and  form  all  the  m-rowed  determinants  from  this  matrix.  The 
sum  of  the  products  of  each  of  these  minors  by  its  algebraic  complement 
is  the  value  of  D. 

Since,  by  our  lemma,  each  of  these  products  when  developed  con- 
sists of  terms  of  D,  it  remains  merely  to  show  that  every  term  of 
D  occurs  in  one  and  only  one  of  these  products.  This  is  obviously 
the  case ;  for  every  term  of  D  contains  one  element  from  each  of  the 
m  rows  of  D  from  which  our  theorem  directs  us  to  pick  out  m-rowed 
determinants,  and,  since  these  elements  all  lie  in  different  columns, 
they  lie  in  one  and  only  one  of  these  w-rowed  determinants,  say  M. 
Since  the  other  elements  in  this  term  of  D  obviously  all  lie  in  the 
complement  N  of  iHf,  this  term  will  be  found  in  the  product  MJ^  and 
in  none  of  the  other  products  mentioned  in  our  theorem. 


EXERCISES 

1.  From  a  square  matrix  of  order  n  and  rank  r,  s  rows  (or  columns)  are  selected. 
Prove  that  the  rank  of  the  matrix  thus  obtained  cannot  be  less  than  r  -{■  s  —  ii. 

2.  Generalize  the  theorem  of  Exercise  1. 

9.  The  Multiplication  Theorem.  Laplace's  Development  enables 
us  to  write  out  at  once  the  product  of  any  two  determinants  as  a 
single  determinant  whose  ordor  is  the  sum  of  the  orders  of  the  two 
given  determinants 


^11   ***   ^In 


^rel   ***   ^nn 


^11     •••   ^im 


Ki  •••  ^ 


whatever  the  values  of  the  p's  may  be.  For,  expanding  the  large 
determinant  in  terms  of  the  9i-rowed  minors  of  the  first  n  rows,  all 
the  terms  of  the  expansion  are  zero  except  the  one  written  in  the 
first  member  of  the  equation. 
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From  this  formula  we  will  now  deduce  a  far  more  important 
one  for  expressing  the  product  of  two  determinants  of  the  same 
order  as  a  determinant  of  that  order.  For  this  purpose  let  us  choose 
the  p's  in  the  last  formula  as  follows : 

p. J  =  0      when  i  =jLJ,     p..  =  -  1, 

and  let  us  consider  for  simplicity  the  product  of  two  determinants 
of  the  third  order.     We  have 
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Let  us  now  reduce  this  six-rowed  determinant  by  multiplying 
its  first  column  by  a^  and  adding  it  to  the  fourth  column;  then 
multiply  the  first  column  by  a^  and  add  it  to  the  fifth;  then 
multiply  the  first  column  by  a^  and  add  it  to  the  sixth.  In  this 
way  we  bring  zeros  into  the  last  three  places  in  the  fourth  row. 
Next  multiply  the  second  column  successively  by  5^,  h^,  b^  and 
add  it  to  the  fourth,  fifth,  and  sixth  columns  respectively. 
Finally  multiply  the  third  column  successively  by  c^^  Cg,  Cg  and 
add  it  to  the  fourth,  fifth,  and  sixth  columns.  The  determinant 
thus  takes  the  form 
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-1       0 
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0       0 


ttg      Ctj^j  4-^2^1  "^^3^1  ^1  ^2  "f~  ^2^2  "^' ^3  ^^2  ^1  ^3  +  ^2  ^3  "^  ^3  ^3 

/^s    ^l«l  +  /32^i  +  /3gCi  ^ia2  +  '^2^2  +  ^3^2  ^l«3  +  /52^3  +  ^3^3 

73     7l<^l  +  72^  +  73<^l  7l<^2  +  72^2  +  73^2  71^3  +  72^3  +  73^3 

0  0  0  0 

0  0  0  0 

-10  0  0 


and  this  reduces  at  once  to  the  three-rowed  determinant 

a^  a^  +  cfg  5 J  -f  «3  c^       a^  a^  +  ^^2  ^2  +  ^z  ^2       ^1  ^3  +  ^2  ^3  +  "3  ^3 
/^i  cti  +  /32^i  4-  /3g  c^     /3i  ^2  4-  ^2  ^2  +  ^3  ^2     ^1  ^3  +  /52  ^3  +  ^3^3 

7l«l  +  72^1  +  73^1        7l«2  +  72^2  +  73^2         7l«3  +  72'^3  +  73^3 
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We  have  thus  expressed  the  product  of  two  determinants  of  the 
third  order  as  a  single  determinant  of  the  third  order.  The  method 
we  have  used  is  readily  seen  to  be  entirely  general,  and  we  thus  get 
the  following  rule  for  multiplying  together  two  determinants  of  the 
nth.  order  : 

Theorem.  The  product  of  two  determinants  of  the  nth  order 
may  he  expressed  as  a  determinant  of  the  nth  order  in  which  the 
element  which  lies  in  the  ith  row  and  jth  column  is  obtained  by 
multiplying  each  element  of  the  ith  row  of  the  first  factor  by  the 
corresponding  element  of  the  jth  column  of  the  second  factor  and 
adding  the  results. 

It  should  be  noted  that  changing  rows  into  columns  in  either  or 
both  of  the  given  determinants,  while  not  affecting  the  value  of  the 
product,  will  alter  its  form  materially.     For  example, 
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and  similarly  the  product  of  any  two  determinants  of  the  same  order 
may  be  written  in  four  different  forms. 

10-  Bordered  Determinants.  If  to  a  determinant  of  the  nth. 
order  we  add  one  or  more  rows  and  the  same  number  of  columns  of 
n  quantities  each  and  fill  in  the  vacant  corner  with  zeros,  the  result- 
ing determinant  is  called  a  bordered  determinant.  Thus  starting  from 
the  two-rowed  determinant 

a     /3 

y    ^ 
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we  may  form  the  bordered  determinants 
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If  in  the  second  of  these  examples  we  use  Laplace's  Development 
to  expand  the  bordered  determinant  according  to  the  two-rowed 
determinants  of  the  last  two  rows,  we  see  that  its  value  is 
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a  quantity  into  which  the  elements  a,  ^8,  7,  B  of  the  original  deter- 
minant do  not  enter.  Similarly  expanding  the  third  of  the  above 
bordered  determinants  according  to  the  three-rowed  determinants  of 
its  last  three  rows,  we  see  that  its  value  is  zero. 

The  reasoning  we  have  here  used  is  of  general  application  and 
leads  to  the  following  results : 

Theorem  1.  If  a  determinant  of  the  nth  order  is  bordered  with 
n  rows  and  n  columns^  the  resulting  determinant  has  a  value  which 
defends  only  on  the  bordering  quantities. 

Theorem  2.  If  a  determinant  of  the  nth  order  is  bordered  with 
more  than  n  rows  and  columns^  the  resulting  determinant  always  has  the 
value  zero. 

The  cases  of  interest  are  therefore  those  in  which  the  deter- 
minant is  bordered  with  less  than  n  rows  and  columns.  Concerning 
these  we  will  establish  the  following  fact : 

Theorem  3.  If  a  determinant  of  the  nth  order  be  bordered  by  p 
rows  and  p  columns  {p  <  n)  of  independent  variables^  the  resulting 
determinant  is  a  polynomial  of  degree  2p  in  the  bordering  quantities^ 
whose  coefficients  are  the  pth  minors  of  the  original  determinant ;  and 
conversely^  every  pth  minor  of  the  original  determinant  is  the  coefficient 
of  at  least  one  term  of  this  polynomial. 
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Let  us  consider  the  special  case  where  n  =  4:  and  j9  =  2. 
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4 

< 

0 

0 

Developing  this  determinant,  by  Laplace's  method  (§  8),  in  terms 
of  the  two-rowed  determinants  of  the  last  two  rows,  we  have 


2>  = 


a 


13 


a 


23 


a 


33 


«14 

^24 


34 


a 


43 


a 


44 


u. 


u. 


Uc 


u, 


H to  6  terms. 


If  now  we  expand  each  of  these  four-rowed  determinants,  by 
Laplace's  method,  in  terms  of  the  two-rowed  determinants  of  their 
last  two  columns,  and  then  arrange  the  result  as  a  polynomial  in 
the  u'^  and  v's,  the  truth  of  the  theorem  is  apparent.  We  leave  it 
to  the  reader  to  fill  in  the  details  of  the  proof  here  sketched. 

11.  Adjoint  Determinants  and  their  Minors. 
Definition.     Jf,  in  the  determinant 


D  = 


«11   • 

••    «1« 

•               • 

•            • 

•           • 

•             • 

««1    • 

^nn 

Aij  is  the  cof  actor  of  the  element  a^j,  then  the  determinant 


D'  = 


is  called  the  adjoint  of  D. 


^11        ***        ^In 


^nl       '"       ^1 
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By  corresponding  minors  of  D  and  I>\  or  indeed  of  any  two 
determinants  of  the  same  order,  we  shall  naturally  understand 
minors  obtained  by  striking  out  the  same  rows  and  columns  from 
D  as  from  B',  These  definitions  being  premised,  the  fundamental 
theorem  here  is  the  following : 

Theorem.  If  D'  is  the  adjoint  of  any  determinajit  7),  and  M  and 
M'  are  corresponding  m-rowed  minors  of  D  and  D'  respectively^  then 
M'  is  equal  to  the  product  of  D^~^  by  the  algebraic  complement  of  M. 

We  will  prove  this  theorem  first  for  the  special  case  in  which 
the  minors  iff  and  M'  lie  at  the  upper  left-hand  corners  of  D  and  D' 
respectively.     We  may  then  write 


M' 


An     • 

^\m 

... 

■^In 

^m\ 

A 

... 

A 

0 

..     0 

1 

0    . 

..     0 

0 

..     Q 

0 

1    . 

..     0 

0 

..     0 

0 

• 

0    . 

•            •            • 

.   1 

Let  us  now  interchange  the  columns  and  rows  of  D, 


D  = 


a 


11 


a 


m 


a 


In       * '  *       ^nn 


a. 


and  then  form  the  product  M' D  by  the  theorem  of  §  9. 
DO  ...     0  0  ...     0 


This  gives 


M'D  = 


0 


0 


D 


0 


<^l,m+l       ^2,  wi+1 


^\n  ^in 


0 


I) 


0 


0 


^m,  7ri+l       ^TO+1,  m+1 


^mn  ^m+1, » 


0 


..     0 


a 


n,m+l 


a. 


=  D' 


^m+l,m+l        *•*       ^n,m+l 


a 


m+l,n 


a. 
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Let  us  here  regard  a^^  •••  a^n  as  n^  independent  variables. 
Then  the  equation  just  written  becomes  an  identity,  from  which  i>, 
since  it  is  not  identically  zero,  may  be  cancelled  out,  and  we  get 


(1) 


7  _  ri?rt-i 


M'=D 


^m-\-l,m+l       '"       ^n,m+l 


a 


m+1,71 


a 


«,  n 


Since  the  determinant  which  is  written  out  in  (1)  is  precisely  the 
algebraic  complement  of  M,  our  theorem  is  proved  in  the  special 
case  we  have  been  considering.  It  should  be  noticed  that  this  proof 
holds  even  in  the  case  m  =  n;  cf.  Corollary  2  below. 

Turning  now  to  the  case  in  which  the  minors  iKf  and  M'  do  not 
lie  at  the  upper  left-hand  corners  of  D  and  D\  let  us  denote  by  a  the 
sum  of  the  numbers  which  specify  the  location  of  the  rows  and 
columns  in  M  or  M\  the  numbering  running,  as  usual,  from  the  up- 
per left-hand  corner.     Then  by  Definition  5,  §  7, 


(2) 


alg.  compl.  of  Jf=(—  1)^  [compl.  of  if]. 


Let  us  now,  by  shifting  rows  and  columns,  bring  the  determinant 
M  into  the  upper  left-hand  corner  of  JD.  Calling  the  determinant 
D,  as  thus  rearranged,  i)p    we  have  (cf.  Theorem  1,  §  8) 


(3) 


D^={-iyD. 


The  cof actors  in  D^  are  equal  to  (  —  l/^y,  since  the  interchange  of 
two  adjacent  rows  or  columns  of  a  determinant  changes  the  sign  of 
every  one  of  its  cofactors.  Accordingly  the  adjoint  of  i>j,  which  we 
will  call  i>p  may  be  obtained  from  D'  by  rearranging  its  rows  and 
columns  in  the  same  way  as  the  rows  and  columns  of  D  were 
rearranged  to  give  i)^,  and  then  prefixing  the  factor  (—  1)"*  to  each 
element. 

Let  us  now  apply  the  special  case  already  established  of  our 
theorem  to  the  determinant  D^  and  its  adjoint  D^,  the  m-rowed 
minors  M-^  and  M\  being  those  which  are  situated  in  the  upper  left- 
hand  corner  of  D^  and  D\  respectively.     We  thus  get 


(4) 


M[=I>J"-^  [alg.  compl.  of  JfJ. 
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Now,  since  M^  is  a  principal  minor,  its  algebraic  complement  is 
the  same  as  its  ordinary  complement,  and  this  in  turn  is  the  same  as 
the  ordinary  complement  of  the  minor  Min  D.  Accordingly,  using 
(2),  we  may  write 

(5)  alg.  compl.  of  M^  =  (-!)''  [alg.  compl.  of  M\ 

Since  the  elements  of  M\  differ  from  those  of  M^  only  in  having 
the  factor  (—  1)'*  prefixed  to  each,  it  follows  that 

(6)  ii[f;  =  (-i)^«if' . 

We  may  now  reduce  (4)  by  means  of  (3),  (5),  and  (6).  We  thus 
^^^       (_  \y<^M^  =  (_  iy(m-i)j)m-i(^_iy  [-^ig^  compl.  of  if]. 

Cancelling  out  the  factor  (— l)"""^  from  both  sides  of  this  equation, 
we  see  that  our  theorem  is  proved. 

We  proceed  now  to  point  out  a  number  of  special  cases  of  this 
theorem  which  are  worth  noting  on  account  of  their  frequent  occur- 
rence. 

CoROLLAKY  1.  If  a^  IS  any  element  of  a  determinant  D  of  the 
nth  order^  and  if  a^  is  the  eof actor  of  the  corresponding  element  A^j  in 
the  adjoint  of  J9,  then 


a,v  =  I)  "a 


This  is  merely  the  special  case  of  our  general  theorem  in  which 
m  =  9^  — 1,  modified,  however,  slightly  in  statement  by  the  use  of 
the  cofactor  aij  in  place  of  the  {n  —  l)-rowed  minor  (—  Vf^^ a^j. 

Corollary  2.  IfD  is  any  determinant  of  the  nth  order  and  W 
its  adjoint^  then  ^,  __  nn-i 

This  is  the  special  case  m  =  n. 

Corollary  3.  If  D  is  any  determinant^  and  S  is  the  second 
minor  obtained  from  it  by  striking  out  its  ith  and  kth  rows  and  its  jth 
and  Ith  columns^  and  if  we  denote  by  A^  the  cofactor  of  the  element 
which  stands  in  the  ith  row  and  the  jth  column  of  D,  then 

Aij     A^i 


Aicj     Aj^i 


This  is  the  special  case  m  —  2. 


D 


CHAPTER  III 
THE  THEORY  OF   LINEAR  DEPENDENCE 

12.  Definitions  and  Preliminary  Theorems.  Two  sets  of  con- 
stants (^j,  6j,  Cj,  cZj,)  and  (^21  ^2^  ^2'  ^2)  ^^^  usually  said  to  be  propor- 
tional to  one  another  if  every  element  of  one  set  may  be  obtained 
from  the  corresponding  element  of  the  other  by  multiplying  by  the 
same  constant  factor.  For  example,  (1,  2,  3,  4)  and  (2,  4,  6,  8)  are 
proportional.  It  is  ordinarily  assumed  that  either  set  may  be  thus 
obtained  from  the  other,  and  in  most  cases  this  is  true ;  but  in  the 
case  of  the  two  sets  (1,  2,  3,  4)  and  (0,  0,  0,  0)  we  can  pass  from 
the  first  to  the  second  by  multiplying  by  0,  but  we  cannot  pass  from 
the  second  to  the  first. 

A  more  convenient  definition,  for  many  purposes,  and  one  which 
is  easily  seen  to  be  equivalent  to  the  above-mentioned  one,  is  the 
following  : 

Definition  1.     The  two  sets  of  constants 

are  said  to  he  proportional  to  each  other  if  two  constants  c-^  and  c^^  not 
both  zero^  exist  such  that 

c^x'i  +  c^x^l  =  0  (z  =  1,  2,  ...  n). 

If  Cj  T^  0,  we  have 

/*.'—_  ^v   y  —  _  ^v^   ...  y  —  _  ^^'' 

1 —  -^\i  -^1 —  ^  **'2'  n —  "^^nt 

and  if  ^2  4^  0,  we  have 

^11—    _  £lV    ^u  _  __^^}    ...V'—  _>£lr' 

The  two  sets  of  constants      ^f  ^f    ...  ^1 

0,  0,  ...   0, 
34 


THE   THEORY  OF   LINEAR  DEPENDENCE  35 

are  evidently  proportional,  since  if  we  take  c^  =  0  and  c^  =  any  con- 
stant not  zero,  we  have  a  pair  of  c's  which  fulfill  the  requirements 
of  our  definition. 

Linear  dependence  may  be  regarded  as  a  generalization  of  the 
conception  of  proportionality.  Instead  of  two  sets  of  constants  we 
now  consider  m  sets,  and  give  the  following  : 

Definition  2.     The  m  sets  of  n  constants  each^ 

x^\xf,  ...  2:W  (4=1,  2,  ...  m), 

are  said  to  he  linearly  dependent  if  m  constants  c^^  c^^  •••  c?^,  not  all  zero^ 
exist  such  that 

^i^i  +  ^2^y  +  •  •  •  +  ^m^f^  =  ^  U  =  1,2,...  n). 

If  this  is  not  the  ease^  the  sets  of  quantities  are  said  to  be  linearly 
independent. 

In  the  same  way  we  generalize  the  familiar  conception  of  the 
proportionality  of  two  polynomials  as  follows : 

Definition  3.  The  m  polynomials  (in  any  number  of  independent 
variables^  /j,  /g,  •  •  •  f^  are  said  to  he  linearly  dependent  if  m  constants 
(?j,  ^2,  .  •  •  c^,  not  all  zero.,  exist  such  that 

^l/l  +  ^2/2  +  •••  +  <^mfm  =  0. 

If  this  is  not  the  case^  the  polynomials  are  said  to  be  linearly  inde- 
pendent.^ 

The  following  theorems  about  linear  dependence,  while  almost 
self-evident,  are  of  sufficient  importance  to  deserve  explicit  state- 
ment : 

Theorem  1.  If  m  sets  of  constants  (or  if  m  polynomials^  are  lin- 
early dependent^it  is  always  possible  to  express  one  —  hut  not  necessarily 
any  one  —  of  them  linearly  in  terms  of  the  others.  This  set  of  constants 
{or  this  polynomial)  is  then  said  to  be  linearly  dependent  on  the  others. 

This  is  seen  at  once  if  we  remember  that  at  least  one  of  the  c's 
is  not  zero.  The  relations  (or  relation)  in  which  the  c's  occur  can, 
then,  be  divided  through  by  this  c. 

*  We  might  clearly  go  farther  and  consider  the  linear  dependence  of  m  sets  of  ?i 
polynomials  each.  The  two  cases  of  the  text  would  be  merely  special  cases  from  this 
general  point  of  view. 
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Theorem  2.  If  there  exist  among  the  sets  of  constants  {or  among  the 
polynomials)  a  smaller  number  of  sets  (or  of  polynomials)  which  are  linearly 
dependent^  then  the  m  sets  {or  the  m polynomials)  are  linearly  dependent. 

For  suppose  there  are  I  sets  of  constants  (or  I  polynomials) 
which  are  linearly  dependent  {I  <  m),  then  we  may  take  for  our  set 
of  m  c's,  the  I  c's  which  must  exist  for  the  I  sets  (or  polynomials)  and 
(m  —  I)  zeros 

Theorem  3.  If  any  one  of  the  m.  sets  of  constants  consists  exclu- 
sively of  zeros  {or  if  any  one  of  the  polynomials  is  identically  zero),  the 
m  sets  {or  the  m  polynomials)  are  linearly  dependent. 

For  we  may  take  for  the  c  corresponding  to  this  particular  set 
(or  polynomial)  any  constant  whatever,  except  zero,  and  for  the  other 
{m  —  1)  c's,  {m  —  1)  zeros. 

13.  The  Condition  for  Linear  Dependence  of  Sets  of  Constants. 
In  considering  m  sets  of  n  constants  each, 

^[i]    /rW    ...  rf^T 
**^  1 1  "^  2  '  **'n 


(1) 


(^  =  l,  2,  •••  m), 


it  will  be  convenient  to  distinguish  between  the  two  cases  m^n  and 

{a)  m<n.  We  wish  here  to  prove  the  following  fundamental 
theorem  : 

Theorem  1.  A  necessary  and  sufficient  condition  for  the  linear 
dependence  of  the  m  sets  {!)  of  n  coyistants  each.,  when  m'^n.,  is  that  all 
the  m-rowed  determinants  of  the  matrix 


should  vanish. 


x[ 
x'l 


^[m] 


U/2 


X\ 


[m] 
2 


X, 


[m] 


That  this  is  a  necessary  condition  is  at  once  obvious;  for  if  the 
m  sets  of  constants  are  linearly  dependent,  one  of  the  rows  can  be 
expressed  as  a  linear  combination  of  the  others.  Accordingly  if  in 
any  of  the  m-rowed  determinants  we  subtract  from  the  elements  of 
this  row  the  corresponding  elements  of  the  other  rows  after  each  row 


THE  THEORY  OF  LINEAR  DEPENDENCE  37 

has  been  multiplied  by  a  suitable  constant,  the  elements  of  this  row 
will  reduce  to  zero.     The  determinant  therefore  vanishes. 

We  come  now  to  the  proof  that  the  vanishing  of  these  deter- 
minants is  also  a  sufficient  condition.  We  assume,  therefore,  that 
all  the  m-rowed  determinants  of  the  above  matrix  vanish.  Let  us 
also  assume  that  the  rank  of  the  matrix  is  r>0*  (cf.  Definition  3, 
§  7).  Without  any  real  loss  of  generality  we  may  (and  will)  assume 
that  the  r-rowed  determinant  which  stands  in  the  upper  left-hand  corner 
of  the  matrix  does  not  vanish  ;  for  by  changing  the  order  of  the  sets 
of  constants  and  the  order  of  the  constants  in  each  set  (and  these 
orders  are  clearly  quite  immaterial)  we  can  bring  one  of  the  non- 
vanishing  r-rowed  determinants  into  this  position. 

We  will  now  prove  that  the  first  {r  4- 1)  sets  of  constants  are  lin- 
early dependent.  From  this  the  linear  dependence  of  the  m  sets 
follows  by  Theorem  2,  §  12. 

Let  us  denote  by  Cj,  c^^  •••  (?^+j  the  cof actors  in  the  (rH-l)-rowed 
determinant  which  stands  in  the  upper  left-hand  corner  of  the  matrix, 
and  which  corres-pond  to  the  elements  of  its  last  column.  If  we  remem- 
ber that  all  the  (r  +  l)-rowed  determinants  vanish,  we  get  the  relations 

c^x]  -f  c^x^l  +  •••  +  ^r+i^""'^  =  0  (y  =r  -f  1,  r  -f  2,  ...  7i). 

Since  the  sum  of  the  products  of  the  elements  of  any  column  of  a 
determinant  by  the  cofactors  of  the  corresponding  elements  of  another 
column  is  zero,  this  equation  is  also  true  when^  =  1,  2,  .••  r. 

This  establishes  the  linear  dependence  of  the  first  (r  -f- 1)  sets  of 
constants,  since  <?^  +  i,  being  the  r-rowed  determinant  which  stands 
in  the  upper  left-hand  corner  of  the  matrix,  is  not  zero. 

(h)  m^n.  This  case  can  be  reduced  to  the  one  already  considered 
by  the  following  simple  device.  Add  to  each  set  of  n  constants  m—n 
zeros.  We  then  have  m  sets  of  m  constants  each.  Their  matrix  con- 
tains only  one  "?72-rowed  determinant,  and  this  vanishes  since  one,  at 
least,  of  its  columns  is  composed  of  zeros.  Therefore  these  m  sets  of  m 
constants  each  are  linearly  dependent ;  and  hence  the  original  m  sets 
of  n  constants  each  were  linearly  dependent.  Thus  we  get  the  theorem  : 

Theorem  2.  m  sets  of  n  constants  each  are  always  linearly  depen- 
dent if  m^n, 

*  In  general  we  shall  have  r  =  m  —  1 ,  but  r  may  have  any  value  less  than  w.  The 
only  case  which  we  here  exclude  is  that  in  which  all  the  elements  of  the  matrix  are 
zero,  a  case  in  which  the  linear  dependence  is  at  once  obvious. 
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EXERCISES 

Determine  whether  the  following  sets  of  constants  are  linearly 


dependent  or  not  : 


1. 


2. 


3  a, 

~2b, 

— 

3  c, 

6  J, 

a, 

0, 

— 

c, 

4rf, 

0, 

-b, 

0, 

3rf. 

1, 

0,     0, 

5, 

1, 

2,     6, 

7, 

3, 

1,     3, 

16. 

5, 

2,     1, 

3, 

4, 

0, 

3,     0, 

0, 

8, 

15, 

7,     3, 

9, 

7. 

5, 

-7, 

0, 

1, 

-1, 

1, 

-3, 

-2, 

3, 

-1, 

4, 

0, 

7, 

— 

9, 

2. 

14.   The  Linear  Dependence  of  Polynomials.     Suppose  we  have  m 
polynomials,  y^^y^,  ...y^_ 


m? 


in  any  number  of  independent  variables.  A  necessary  and  sufficient 
condition  for  the  linear  dependence  of  these  polynomials  is  evidently 
the  linear  dependence  of  their  m  sets  of  coefficients.  Thus  the  condi- 
tions deduced  in  the  last  section  can  be  applied  at  once  to  the  case  of 
polynomials. 

EXERCISES 

Determine  whether  the  following  polynomials  are  linearly  depend- 
ent  or  not: 


1. 


2. 


16a:  -f  30^, 

6ar  +  2y+   .Sz-    4, 
15x  +  9y  -18. 

3  x^  +  4  :r2  —  4  3^3  +  6  a:^, 

I  X-,  -f-  O  Xg  -|-   /  x^, 

.  —  5  Xj  4-  9  ^2  —     Xq  -\-  4cX^-{-  S. 

2x^+    Sxy+    6  2/2  +  14  a:  +  12  3/  -    4, 
7  x^  +       ?/2  +    6  a;  -    4  3/, 

3ar2-    6x3/+    32/2-    5a:  +    7, 

5a,-2  +  20x1/  +  15z/2  +  35a:  +  30y  -  10. 
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15.  Geometric  Illustrations.  The  sets  of  n  constants  with  which 
we  had  to  deal  in  §§  12,  13  may,  provided  that  not  all  the  constants 
in  any  one  set  are  zero,  advantageously  be  regarded  as  the  homoge- 
neous coordinates  of  points  in  space  of  ^  —  1  dimensions.  It  will 
then  be  convenient  to  speak  of  the  linear  dependence  or  independ- 
ence of  these  points.  The  geometric  meaning  of  linear  dependence 
will  be  at  once  evident  from  the  following  theorems  for  the 
case  n  =  4:. 

Two  points  will  here  be  represented  by  two  sets  of  four  constants 

^1'  y^  ^v  ^1' 

^2'  ^2'  ^2'     2' 

which  will  be  linearly  dependent  when,  and  only  when,  they  are  pro- 
portional, that  is,  when  the  points  coincide.     Hence: 

Theorem  1.  Two  points  are  linearly  dependent  wTien^  and  only 
wJien^  they  coincide. 

If  we  have  -three  points  in  space,  P^  P^^  Pg,  whose  coordinates 
are  {x^,  y^,  ^i,  t^),  {x^,  y^,  z^,  t^),  {x^,  j/g,  z^,  t^),  respectively,  and 
which  are  linearly  dependent,  there  must  exist  three  constants  Cj,  c^, 
Cg,  not  all  zero,  such  that 

6-1  ^-i     ~|—    CffXfy    ~\~    CnXn     V-', 

^1^1 +  ^2^2 +  ^3^3  =  ^1 
^1^1  ~l~  ^2^2  "^  ^3^3  ^^  ^' 
^1  ^1    "^~  ^2  ^2   "'"  ^3  ^3    ~  ^* 


Let  us  suppose  the  order  of  the  points  to  be  so  taken  that  c^  =?bO,  and 

Xn    -—   /l^l  X-i    "]       li/nXn^ 

yz  =  Ky\-^Kyv 

Z^  =  Ic^Z-^  -{-  /u2^2' 
Cg  =z  Ic^t-^   -f-  /u2 ^2? 


solve  for  x^,  y^,  z^,  t^i 

(1) 


where  ^^  =  —  e^/c^^  h^—  —  cjc^.     Now  if 

Ax^By^-Oz^Bt^^ 

is  the  equation  of  any  plane  through  the  points  F^  and  Pg^  ^^  have 

Ax^  -h  By^  4-  Cz^  -h  Bt^  =  0, 
Ax^  -h  P^2  +  ^^2  +  ^h  =  ^- 
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Multiplying  the  first  of  these  equations  by  h^^  the  second  by  k^^^  and 
adding,  we  have,  by  means  of  the  equations  (1), 

Ax^  +  By^  4-  Cz^  +  Dt^  =  0. 

Hence  every  plane  through  P^  and  P^  passes  through  Pg  also,  and 
the  three  points  are  collinear. 

Now,  in  order  to  prove  conversely  that  any  three  collinear  points 
are  linearly  dependent,  let  us  suppose  the  three  points  P^,  P^^  Pg 
collinear.  We  may  assume  that  these  three  points  are  distinct,  as 
otherwise  their  linear  dependence  would  follow  from  Theorem  1. 
We  have  seen  that  when  three  points  are  linearly  dependent,  the  line 
through  two  of  them  contains  the  third.     Hence  if  we  let 


2^2 


x'  =  k^x^  +  k 

y'  =  hvi  +  hvv 

Z    =  «'i2'i  ~r"  ^2^2' 
t    =  A^jCj  H"  ^2  2' 

where  k-^  and  k^  are  two  constants,  not  both  zero,  the  point  {x\  y\ 
2',  ^')  or  P^  lies  on  the  line  P^P^^  and  our  theorem  will  be  established 
if  we  can  show  that  the  constants  k-^  and  k^  can  be  so  chosen  that 
the  points  P'  and  Pg  coincide.  Now  let  ax  -{-by  -\-  cz  +  dt=0  be  the 
equation  of  any  plane  through  the  point  Pg  but  not  through  P^  or  Pg. 
Thus  Pg  is  determined  as  the  intersection  of  this  plane  with  the  line 
PiP^i  so  that  if  P^  which  we  know  lies  on  P1P2,  can  be  made  to  lie 
in  this  plane,  it  must  coincide  with  Pg  and  the  proof  is  complete. 
The  condition  for  P'  to  lie  in  this  plane  is  ax'  +  hy'  +  cz'  +  dt'  =  0. 
Substituting  for  x\  y\  z\  t'  their  values  given  above,  we  have 

k-^{ax-^  +  hy-^  +  cz^  +  dt-^  +  k^ax^  +  hy^  +  cz^  +  dt^  =  0. 

But  neither  of  these  parentheses  is  zero,  since  the  plane  does  not  pass 
through  Pj  or  Pg,  hence  we  may  give  to  k^  and  k^^  values  different 
from  zero  for  which  this  equation  is  satisfied.     We  have  thus  proved 

Theorem   2.    Three  points  are  linearly  dependent  when^  and  only 
when,  they  are  collinear. 

The  proofs  of  the  following  theorems  are  left  to  be  supplied  by 
the  reader.     It  will  be  found  that  some  of  them  are  readily  proved 
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from  the  definition  of  linear  dependence,  as  above,  while  for  others 
it  is  more  convenient  to  use  the  condition  for  linear  dependence  ob- 
tained in  §  13. 

Theorem  3.  Four  points  are  linearly  dependent  when^  and  only 
when,  they  are  complanar. 

Theorem  4.    Five  or  more  points  are  always  linearly  dependent. 

Another  geometric  application  is  suggested  by  the  following  con- 
siderations: 

A  set  of  n  ordinary  *  quantities  is  nothing  more  nor  less  than  a 
complex  quantity  with  n  components  (cf.  §  21).  Our  first  definition  of 
linear  dependence  is  therefore  precisely  equivalent  to  the  following: 

The  m  complex  quantities 

are  said  to  he  linearly  dependent  if  m  ordinary  quantities  c^^  c^,  •••  c^^ 
not  all  zero,  exist  such  that : 

C^a^  ~r  ^2^2    •      "*    ~i"  ^m^m  ^^  "• 

Now  the  simplest  geometric  interpretation  for  a  complex  quantity 
with  n  components  is  as  a  vector  in  space  of  n  dimensions,!  and  we 
are  thus  led  to  the  conception  of  linear  dependence  of  vectors.  The 
geometric  meaning  of  this  linear  dependence  will  be  seen  from  the 
following  theorems  for  the  case  n  =  S: 

^  Theorem  5.    Two  vectors  are  linearly  dependent  when,  and  only 
when,  they  are  collinear. 

Theorem  6.  Three  vectors  are  linearly  dependent  when,  and  only 
when,  they  are  complanar. 

Theorem.  7.    Four  or  more  vectors  are  always  linearly  dependent. 

In  order  to  get  a  geometric  interpretation  of  the  linear  dependence 
of  polynomials,  we  must  consider,  not  the  polynomials  themselves, 
but  the  equations  obtained  by  equating  them  to  zero.  We  speak  of 
these  equations  as  being  linearly  dependent  if  the  polynomials  are 

*  Two  different  standpoints  are  here  possible  according  as  we  understand  the  term 
ordinal^  quantity  to  mean  real  quantity^  or  ordinary  complex  quantity. 

t  There  are  of  course  other  possible  geometric  interpretations.  Thus  in  the  case 
n=  4  we  may  regard  our  complex  quantities  as  quaternions,  and  consider  the  meaning 
of  linear  dependence  of  two,  three,  or  four  quaternions. 
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linearly  dependent.  If  then  we  regard  the  independent  variables  as 
rectangular  coordinates,  these  equations  give  us  geometric  loci  in 
space  of  as  many  dimensions  as  there  are  independent  variables. 
Thus,  in  the  cases  of  two  and  three  variables,  we  have  plane  curves 
and  surfaces  respectively.  The  case  of  two  loci  is  of  no  interest, 
as  they  must  coincide  in  order  to  be  linearly  dependent.  In  the  case 
of  three  linearly  dependent  loci  it  is  easily  shown  that  any  one  must 
meet  the  other  two  in  all  their  common  points  and  in  no  others. 
The  following  theorems  will  serve  to  illustrate  the  geometric  mean- 
ing of  linear  dependence  : 

(1)  In  the  plane  : 

Theorem  8.  Three  circles  are  linearly  dependent  when^  and  only 
when^  they  belong  to  the  same  coaxial  family. 

Theorem  9.  Four  circles  are  linearly  dependent  when,  and  only 
when,  they  have  a  (real  or  imaginary)  coynmon  orthogonal  circle. 

Theorem  10.  Four  circles  are  linearly  dependent  when,  and  only 
when,  the  points  of  intersection  of  the  first  and  second,  and  the  points  of 
intersection  of  the  third  and  fourth,  lie  on  a  common  circle. 

Theorem  11.    Five  or  more  circles  are  always  linearly  dependent. 

(2)  In  space  (using  homogeneous  coordinates)  : 

Theorem  12.  Three  planes  are  linearly  dependent  when,  and  only 
when,  they  intersect  in  a  line. 

Theorem  13.  Four  planes  are  linearly  dependent  when,  and  only 
when,  they  intersect  in  a  point. 

Theorem  14.    Five  or  more  planes  are  always  linearly  dependent. 


CHAPTER   IV 

LINEAR   EQUATIONS 

16.  Non-homogeneous  Linear  Equations.  In  every  elementary 
treatment  of  determinants,  however  brief,  it  is  explained  how  to 
solve  by  determinants  a  system  of  n  equations  of  the  first  degree  in 
n  unknowns,  provided  that  the  determinant  of  the  coefficients  of 
the  unknowns  is  not  zero.  Cramer's  Rule,  by  which  this  is  done, 
is  this : 

Cramer's  Rule.     If  in  the  equations 


the  determinant 


^n\  *^l    r  •  •  •  ~r  (f'nn  ^n  —  ^tH 


a  — 


a 


11 


a 


m 


a 


m 


a 


nn 


is  not  zero^  the  equations  have  one  and  only  one  solution^  namely : 


tt-t                                ttn 
fy      _   ±.        /y       —  ^ 

^      a       ^      a 


^n  —  —  ' 

a 


where  a^  is  the  n-rowed  determinant  obtained  from  a  by  replacing  the 
elements  of  the  ith  column  by  the  elements  k-^^  k^-,  •  •  •  k^- 

This  rule,  whose  proof  we  assume  to  be  known,*  is  of  funda- 
mental importance  in  the  general  theory  of  linear  equations  to 
which  we  now  proceed. 

*  The  proof  as  given  in  most  English  and  American  text-books  merely  establishes 
the  fact  that  if  the  equations  have  a  solution  it  is  given  by  Cramer's  formulse.  That 
these  formulae  really  satisfy  the  equations  in  all  cases  is  not  commonly  proved,  but 
may  be  easily  estabished  by  direct  substitution.     We  leave  it  for  the  reader  to  do  this. 
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Consider  the  system  of  m  linear  equations  in  n  variables: 
^11^1+  •••  +«in^«  +  ^i  =  0, 


^ml^l  "i" 


•      ^mnP^nt^m  —  ^t 


where  m  and  n  may  be  any  positive  integers.     Three  cases  arise: 

(1)  The  equations  may  have  no  solution,  in  which  case  they  are 
said  to  be  inconsistent. 

(2)  They  may  have  just  one  solution. 

(3)  They  may  have  more  than  one  solution,  in  which  case  it  will  pres- 
ently appear  that  they  necessarily  have  an  infinite  number  of  solutions. 

Let  us  consider  the  two  matrices: 


a  = 


a 


11 


a 


\n 


a 


in\ 


a. 


b  = 


a 


11 


«in  ^1 


a 


m\ 


a 


mn  "m 


We  will  call  a  the  matrix  of   the  system  of   equations,  b  the  aug- 
mented matrix. 

It  is  evident  that  the  rank  of  the  matrix  a  cannot  be  greater 
than  that  of  the  matrix  b,  since  every  determinant  contained  in  a 
is  also  contained  in  b.     We  have,  then,  two  cases: 

I.  Rank  of  a  =  Rank  of  b. 

II.  Rank  of  a  <  Rank  of  b. 
We  will  consider  Case  II  first. 

Let  r  be  the  rank  of  b.  Then  b  must  contain  at  least  one 
r-rowed  determinant  which  is  not  zero.  Moreover,  this  determinant 
must  contain  a  column  of  b's,  since  otherwise  it  would  be  contained 
in  a  also,  which  is  contrary  to  our  hypothesis.  Suppose  for  definite- 
ness  that  this  non-vanishing  r-rowed  determinant  is  the  one  situ- 
ated in  the  upper  right-hand  corner  of  b.  There  is  no  loss  of 
generality  in  assuming  this,  since  by  writing  the  equations  in  a  dif- 
ferent order  and  changing  the  order  of  the  variables  x-^^  •••  x^  we 
can  always  bring  the  determinant  into  this  position.  Now  for 
brevity  let  us  represent  the  polynomials  forming  the  first  members 
of  our  given  equations  by  F^,  F^^  •••  F^  respectively,  and  the 
homogeneous  polynomials  obtained  by  omitting  the  constant  terms 
in  each  of  these  equations  by  /j,  /g,  •••  /;„.  Then  we  have  the 
identities : 


Fi=fi-\-h 


(^  =  l,  2,  ...  m). 
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Consider  the  first  r  of  these  identities.  Since  the  rank  of  a  is 
less  than  r,  the  polynomials  /j,  /g,  •  •  •  /^  are  linearly  dependent, 

hence  ^1-^1  H •"  ^r-^r  =  ^1^1+ h  <?r^r  =  ^• 

But  since  the  rank  of  b  is  r,  the  polynomials  F^^  "  F^  are  linearly 
independent  and  therefore  (7=^  0.  Hence  the  given  equations  are  in- 
consistent^ for  if  they  were  consistent  all  the  JP's  would  be  zero  for 
some  suitably  chosen  values  of  Xy,  •••  a;„,  and  if  we  substitute  these 
values  in  the  last  written  identity  we  should  have 

0=    (7:9^0. 

Let  us  now  consider  Case  I.  Let  r  be  the  common  rank  of  a 
and  b,  then  there  is  at  least  one  r-rowed  determinant  in  a  which 
is  not  zero.  This  same  determinant  also  occurs  in  b.  Suppose  it  to 
be  situated  in  the  upper  left-hand  corner  of  each  matrix.  Since  all 
(r  +  1) -rowed  determinants  of '^either  matrix  are  zero,  the  first  (r+1) 
of  the  J^'s  are  linearly  dependent,  and  we  have 

and,  since  F^  '-  F^  are  linearly  independent,  c^^-^  cannot  be  zero  ; 
hence  we  may  divide  through  by  it  and  express  F^^^  linearly  in 
terms  of  Fy,  •  •  •  F^.  The  same  argument  holds  if  instead  of  F^^-^  we 
take  Fr+2  or  any  other  one  of  the  remaining  ^'s.     Hence 

F,^,  =  k^i^F^-{--  +  k^i}F,  G  =  l,  2,  ...m-r). 

From  these  identities  it  is  obvious  that  at  any  point  {x^  ---x^  where 
Fy  '"  Fr  all  vanish,  the  remaining  ^'s  also  vanish.  In  other  words, 
any  solution  which  the  first  r  equations  of  the  given  system  may 
have  is  necessarily  a  solution  of  the  whole  system. 

Now  consider  the  first  r  of  the  given  equations.  Assign  to 
aJ^+i,  '  '  '  x^  any  fixed  values  a;J.+i,  -  -  -  x^^  and  transpose  all  the  terms 
after  the  rth  in  each  equation  to  the  second  member. 
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Remembering  that  the  right-hand  sides  of  these  equations  are  known 
constants,  and  that  the  determinant  of  the  coefficients  on  the  left  is 
not  zero,  we  see  that  we  have  the  case  to  which  Cramer's  Rule 
applies,  and  that  this  system  of  equations  has  therefore  just  one 
solution.  Hence  the  given  system  of  equations  is  consistent,  and  we 
have  the  theorem: 

Theorem  1.  A  necessary/  and  sufficient  condition  for  a  system  of 
linear  equations  to  be  consistent  is  that  the  matrix  of  the  systein 
have  the  same  rank  as  the  augmented  matrix. 

From  the  foregoing  considerations  we  have  also 

Theorem  2.  If  in  a  system  of  linear  equations  the  matrix  of 
the  system  and  the  augmented  matrix  have  the  same  rank  r,  the  values 
of  n  —  r  of  the  unknowns  may  he  assigned  at  pleasure  and  the  others 
will  then  he  uniquely  determined. 

The  n—  r  unknowns  whose  values  may  he  assigned  at  pleasure  may 
he  chosen  in  any  way  provided  that  the  matrix  of  the  coefficients  of 
the  remaining  unknowns  is  of  rank  r. 


EXERCISES 

Solve  completely  the  following  systems  of  equations: 


1. 


2x-    2/ +  32-    1  =  0, 

4a: -2^-    2+     3  =  0, 

2a:  -    y  -4:z  -\-    4  =  0, 

10a:-  53/-6S  +  10  =  0. 


2.  \ 


4a:—    y  -\-     2  +  5  =  0, 

2a:  -  3?/  +  52  +  1  =  0, 

x+    2/- 22 +  2  =  0, 

5a;         -2  +  2  =  0. 


3.  \ 


2a:  -  3?/  +  42  -     w;  =  3, 

X  -{-2y  -     2  +  2w;  =  1, 

3a:  -     y  -\-2z-  3w  =  4:, 

3a:—    y  +     z  -  7 w  =  4:. 
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17.    Homogeneous  Linear  Equations.      We  will  now  consider  the 
special  case  where  the  equations  of  the  last  section  are  homogeneous^ 

i.e.  where  all  the  5's  are  zero, 

* 


^m\^l    I     ***     \^mn-^n  —  ^* 


The  matrices  a  and  b  of  the  last  section  differ  here  only  by  a  column 
of  zeros  ;  hence  they  always  have  the  same  rank  and  this  is  called 
the  rank  of  the  system  of  equations.  Theorems  1  and  2  of  the  last 
section  become 

Theorem  1.  A  system  of  homogeneous  linear  equations  always 
has  one  or  more  solutions. 

Theorem  2.  If  the  rank  of  a  system  of  homogeneous  linear  equa- 
tions in  n  variables  is  r,  the  values  of  n  —  r  of  the  unknowns  may  he 
assigned  at  pleasure  and  the  others  will  then  he  uniquely  determined.* 

If  the  rank  of  the  equations  is  w,  there  will  therefore  be  only  one 
solution,  and  this  solution  is  obviously  x^  =  X2=  •••=  o:^  =  0.  Since 
the  rank  can  never  be  greater  than  n^  we  have 

Theorem  3.  A  necessary  and  sufficient  condition  for  a  system  of 
homogeneous  linear  equations  in  the  n  variables  (iCj,  •••  x^)  to  have  a  solu- 
tion other  than  Xi  =  X2  =  ■••  =  x^  =  0  is  that  their  rank  he  less  than  n. 

Corollary  1.  If  there  are  fewer  equations  than  unknowns^  the 
equations  always  have  solutions  other  than  x-^=  x^  =  •••  =  a;^  =  0. 

Corollary  2.  If  the  number  of  equations  is  equal  to  the  number 
of  unknowns.,  a  necessary  and  sufficient  condition  for  solutions  other  than 
x-^  =  x^=  ••'  =  Xn=  0  is  that  the  determinant  of  the  coefficients  he  zero. 

In  the  special  case  where  the  number  of  equations  is  just  one  less 
than  the  number  of  unknowns  and  the  equations  are  linearly 
independent,  we  will  prove  the  following : 

Theorem  4.  Every  set  of  values  of  a;^, •••a;^  which  satisfies  a 
system  of  n  —  1  linearly  independent^^  homogeneous  linear  equations  in 

*  Cf.  also  the  closing  lines  of  Theorem  2,  §  16. 

t  The  theorem  is  still  true  if  the  equations  are  linearly  dependent,  but  it  is  then 
trivial,  since  the  determinants  in  question  are  all  zero. 
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n  unknowns  is  proportional  to  the  set  of  (n  —  lyrowed  determinants 
taken  alternately  with  plus  and  minus  signs,  and  obtained  by  striking 
out  from  the  matrix  of  the  coefficients  first  the  first  column,  then  the 
second,  etc. 

Let  us  denote  by  ai  the  {n  —  l)-rowed  determinant  obtained  by 
striking  out  the  ^th  column  from  the  matrix  of  the  equations.  Since 
the  equations  are  linearly  independent,  there  must  be  at  least  one 
of  the  determinants  a-^,  a^,  •••  a^  which  is  not  zero.  Let  it  be  a^. 
Now  assign  to  x^  any  fixed  value,  c,  and  transpose  the  ^th  term  of 
each  equation  to  the  second  member  and  we  have 

Hence  :  x,^  =  ^ ^ — '■ — '■ — -  (k=l,  2,  •  •  •  n), 

a^ 

from  which  it  is  clear  that  (x^,  •••  x^)  are  proportional  to  the  de- 
terminants (^j,  —  ^2?  «3i  •••  (  —  l)"'"!^^),  as  was  to  be  proved. 

The  theory  of  homogeneous  linear  equations  has  here  been  de- 
duced from  the  theory  of  linear  dependence.  It  can,  however,  in 
turn  be  used  to  obtain  further  results  in  this  last-mentioned  theory. 
As  an  example  of  this  we  will  deduce  the  following  theorem,  which 
we  shall  find  useful  later : 

Theorem  5.  If  a  set  of  points  (x^,  •••  a;„),  finite  or  infinite  in 
number,  have  the  property  that  k  points  can  be  found  among  them  upon 
which  every  other  point  of  the  set  is  linearly  dependent,  then  any  k-^X 
points  of  the  set  will  be  linearly  dependent. 

Let  {x\,  •••  xl),  (x'-l,  •••  xH),  •••  (x^f\  '"  x^J^^)  be  the  k  points  upon 
which  every  other  point  of  the  set  is  linearly  dependent,  and  let 

(X[,  ...  X'„),  (X'l,  ...  X':), ...  (xr", ...  XJ.*«]) 

be  any  k-\-l  points  of  the  set.     Then  we  may  write 

'Xi^^  =  cfl  x[  +  ct| x'l-{-'---\-  4^JxW , 
(1)  ^ (*  =  1,  2,  --k^l) 

.xii^=c^fx',+c^x':  +  .:  +  c^^xi^\ 
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This  is  true  by  hypothesis  if  (Xf^,  •••  XJ'"^)  is  not  one  of  the  first  k 
points,  and  if  it  is  one  of  these  points,  it  is  obviously  true.  We 
have  then  to  prove  that  k  -\-\  constants,  (7j,  63,  •  •  •  (7^+i,  not  all  zero, 
can  be  found  such  that 


(7iX;.  +  c^x"j  +  -  +  ci+iXf  ^"  =  0 


(y  =  i,  2,  ...n). 


By  substituting  here  the  values  of  the  X's  from  (1),  we  see  that 
these  equations  will  be  fulfilled  if 


and  this  is  a  system  of  fewer  equations  than  unknowns,  which  is 
therefore  satisfied  by  a  set  of  (7's  not  all  zero.  (Cf.  Theorem  3, 
Cor.  1.) 

EXERCISES 

Solve  completely  the  following  systems  of  equations : 


1. 


2. 


'  11  X  +  8  2/  -  2  2  +  3  z^  =  0, 
2x  -\-  3y  —  z  -\-  2iv  =  0, 
7  X—  y  -{-  z  -  3«'  =  0, 
4x-ll2/  +  5s-12wj  =  0. 

2x-  32/4-52+  3zy  =  0, 
4  a;  —  2/  +  2  +  w;  =  0, 
3a:-    22/  +  32+    ^w  =0. 


18.   Fundamental   Systems    of   Solutions   of   Homogeneous   Linear 
Equations.     If  {x[^'"x'n)  is  a  solution  of  the  system  of  equations 


(1) 


^11  "^l  "T"  *  *  ■  'T"  ^in-^n  —  ^1 


^ml^l  +  •  •  •  +  ^7nn^r, 


0, 


then  {cx[^---cXn)  is  also  a  solution,  and  by  giving  to  c  different  values 
we  get  thus  (except  in  the  special  case  in  which  the  x"s  are  all  zero) 
an  infinite  number  of  solutions.  These  may  include  all  the  solutions 
of  (1)  (cf.  Theorem  4  of  the  last  section),  but  in  general  this  will 
not  be  the  case. 
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Suppose,  again,  that  (x[,  •••  xl)  and  (x^^,  •  ••  x'l)  are  two  solutions  of 
(1),  then  {c^x[  +  c^x'l,  •  •  •  e^x^  +  c^x'^)  is  also  a  solution.  If  the  two 
given  solutions  are  proportional  to  each  other,  this  clearly  gives  us 
nothing  more  than  what  we  had  above  by  starting  from  a  single 
solution  ;  but  if  these  two  solutions  are  linearly  independent,  we 
build  up  from  them,  by  allowing  e^  and  c^  to  take  on  all  values,  a 
doubly  infinite  system  of  solutions ;  but  even  this  system  will 
usually  not  include  all  the  solutions  of  (1).  Similarly  we  see 
that,  if  we  can  find  three  linearly  independent  solutions,  we  can 
build  up  from  them  a  triply  infinite  system  of  solutions,  etc.  If, 
proceeding  in  this  way,  Ave  succeed  in  finding  a  finite  number  of 
linearly  independent  solutions  in  terms  of  which  all  solutions  can 
be  expressed,  this  finite  number  of  solutions  is  said  to  form  a 
fundamental  system. 

Definition.  If  (a;fj,  •••  x^'J)  (^=  1,  2,  •••  A")  are  a  system  of  h 
solutions  o/'(l)  which  satisfy  the  following  two  conditions^  they  are  said 
to  form  a  fundamental  system: 

(a)  They  shall  he  linearly  independent. 

(b)  Every  solution  of  (\)  shall  he  expressible  in  the  form 

{c^x[^-c^x'l  H Vcj,x^l\  c^xl  +  e^xl-\ \-Cj^^^^). 

Theorem  1.  If  the  equations  (1)  are  of  rank  r  <n^  they  possess 
an  infinite  number  of  fundamental  systems  each  of  which  consists 
of  n  —  r  solutions. 

Suppose  the  r-rowed  determinant  which  stands  in  the  upper  left- 
hand  corner  of  the  matrix  of  the  equations  (1)  does  not  vanish,  and 
let  us  consider  the  first  r  of  these  equations.  Any  solution  of  these 
will  be  a  solution  of  all  the  others.  Transpose  all  terms  after  the 
rth  to  the  second  members,  and  let  (a^^+j,  •••  x^)  have  any  fixed  set 
of  values  (a;J.+2,  •••  a;J),  not  all  zero;  then  these  r  equations  will  have 
just  one  solution  given  by  Cramer's  Rule.  Call  it  {x[,  «••  x^,). 
Now  let  {Xr+i^  •'•  x^)  have  any  other  fixed  set  of  values  {xJl-^,  •••  a;JJ), 
not  all  zero,  and  we  get  another  solution,  (a;'/,  •••  x'^)-  Continue  in 
this  way  until  we  have  n  —  r  solutions 

/yf  •  •  •       ^  ^y  ...         o^ 
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If  we  have  chosen  these  n  —  r  sets  of  values  for  (a^^+j, 
the  determinant  j  ^i 
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Xn)  SO  that 


(2) 


X 


r+\ 


Xn 


is  not  zero,  —  and  this  may  clearly  be  done  in  an  infinite  variety  of 
ways,  —  these  n  —  r  solutions  will  be  linearly  independent.  That  is 
to  say,  we  may  thus  obtain  an  infinite  number  of  sets  oi  n  —  r  solu- 
tions each,  each  of  which  satisfies  condition  (a)  of  our  definition  for 
a  fundamental  system. 

To  prove  that  these  sets  of  solutions  also  satisfy  condition  (6),  let 
us  suppose  that  (Xj,  •••  X^)  is  any  solution  of  the  r  equations  we  are 
considering.  The  last  n  —  r  oi  these  JTs  are  linearly  dependent  on 
the  n—r  sets  of  values  we  have  chosen  for  (a;^+i,  •  •  •  x^)  since  we  have 
here  more  sets  of  constants  than  there  are  elements  in  each  set  (cf. 
Theorem  2,  §13),  and  the  determinant  (2)  is  not  zero.     Thus 


(3)      Xi  =  c^x'i  +-  c^x'l  + . . .  +  c^.^^r'^ 


(z  =  r  +  1,  /•  +  2,  . .  •  n). 


Let  us  now  solve  the  first  r  equations  (1)  by  Cramer's  Rule,  regard- 
ing rc^+-^,  •••  x^^  as  known.     We  thus  get  results  of  the  form 


x„ 


—  ■^j'^r+l  "I"  -^j  '^r+2  H~  *  "  "^  -^j        "^Ti 


By  assigning  special  values  here  to  x. 


r+V 


x^,  we  get 


r). 


(4) 


X. 


'-  A'-' 


j^r+l 


'  j^r+2       I"  *  *  ■  "T  ^j         ^n? 


ry,[n-r] A  /^[n-r]    i      Al/^ln—r]  _i_  . . .  _i_    A[n—r]^[n-r] 


(y  =  l,  2, -r) 


^r+1 


r+2 


re- 


If  we  multiply  the  first  7i  —  r  oi  these  equations  by  <?j,  •••  c^,^^ 
spectively  and  add,  we  get,  by  (3), 

cj^.  +  . . .  -f  ^^_^|«-^^  =  A'jX,^^  +  •"  +  Af-'^X^. 

Consequently,  by  the  last  equation  (4), 
(5)  Xj  =  c^x'j  -f  .  • .  e,.,xlr^-^^  (j  =  1,  2,  •  •  •  r> 

Equations  (3)  and  (5)  together  prove  our  theorem. 
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We  thus  see  that  the  totality  of  all  solutions  of  the  system  (1) 
forms  a  set  of  points  satisfying  the  conditions  of  Theorem  5,  §  17. 
Consequently, 

Theorem  2.  If  the  rank  of  a  system  of  homogeneous  linear  equa- 
tions in  n  variables  is  r,  then  any  ti  —  r  -h  1  solutions  are  linearly 
dependent. 

Finally  we  will  prove  the  theorem: 

Theorem  3.  A  necessary  and  sufficient  condition  that  a  set  of 
solutions  of  a  system  of  homogeneous  linear  equations  of  rank  r  in  n 
variables  form  a  fundamental  system  is  that  they  be 

{a)  linearly  independent^ 
(b)  n  —  r  in  number. 

By  definition,  (a)  is  a  necessary  condition.  To  see  that  (5)  also 
is  necessary,  notice  that  by  Theorem  2  there  cannot  be  more 
than  n  —  r  linearly  independent  solutions.  We  have,  then, 
merely  to  show  that  I  linearly  independent  solutions  never  form 
a  fundamental  system  when  l<  n  —  r.  If  they  did,  then  by 
Theorem  5,  §  17,  any  set  of  Z  +  1  solutions  would  be  linearly 
dependent,  and  therefore  the  same  would  be  true  of  any  set  of 
n  —  r  solutions  (since  n  —  r^l  -\-  \).  But  by  Theorem  1,  this 
is  not  true. 

In  order  now  to  prove  that  conditions  (a)  and  {b)  are  also 
sufficient,  let 

{xf,  xl\  ...  x^^)  (i  =  1,  2,  ...  n  -  r) 

be  any  system  oi  n  —  r  linearly  independent  solutions  of  our  system 
of  equations,  and  let  (a^j,  ...  x^)  be  any  solution  of  the  system. 
Then,  by  Theorem  2,  we  have  n  —  r  -\-  1  constants  (c^,  ...  c„_^+i), 
not  all  zero,  and  such  that 

c^x^  +  e^x^ -{-  ...  -hc^_rx}''-''^-{-c^_r+iXj  =  0  (y=l,  2,  ...  n). 

But  since  the  n  —  r  given  points  are  linearly  independent,  c^.^  +  j  ^  0; 
accordingly  these  last  equations  enable  us  to  express  the  solu- 
tion (a^i,  ...  x^)  linearly  in  terms  of  the  n  —  r  given  solutions, 
and   this   shows   that   these   n  —  r   solutions   form   a   fundamental 

svstem. 
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EXERCISES 

1.  Prove  that  all  the  fundamental  systems  of  solutions  of  a  system  of  homo- 
geneous linear  equations  are  included  in  the  infinite  number  obtained  in  the 
proof  of  Theorem  1. 

2.  Given  three  planes  in  space  by  their  equations  in  homogeneous  coordinates. 
What  are  their  relative  positions  when  the  rank  of  the  system  of  equations  is  3  ? 
when  it  is  2  ?  when  it  is  1  ? 

3.  Given  three  planes  in  space  by  their  equations  in  non-homogeneous  coordi- 
nates. What  are  their  relative  positions  for  the  dilferent  possible  pairs  of  values 
of  the  ranks  of  the  matrices  and  augmented  matrices  ? 


CHAPTER  V 


SOME   THEOREMS   CONCERNING   THE   RANK   OF   A   MATRIX 

19.  General  Matrices.  In  order  to  show  that  a  given  matrix 
is  of  rank  r,  we  have  first  to  show  that  at  least  one  r-rowed  deter- 
minant of  the  matrix  is  not  zero,  and  secondly  that  all  (r  -f  1)- 
rowed  determinants  are  zero.  This  latter  work  may  be  considerably 
shortened  by  the  following  theorem  : 

Theorem  1.  If  in  a  given  matrix  a  certain  r-rowed  determinant 
is  not  zero,  and  all  the  {r-\-l)-rowed  determinants  of  which  this  r-rowed 
determinant  is  a  first  minor  are  zero^  then  all  the  {r-\-iyrowed  deter- 
minants of  the  matrix  are  zero. 

We  will  assume,  as  we  may  do  without  loss  of  generality,  that 
the  non-vanishing  r-rowed  determinant  stands  in  the  upper  left- 
hand  corner  of  the  matrix.     Let  the  matrix  be 


a 


11 


a 


m 


a 


ml 


a„ 


and  consider  the  r -\-\  sets  of  n  quantities  each  which  lie  in  the  first 
r  -f- 1  rows  of  this  matrix.  These  r  -h  1  sets  of  quantities  are  linearly 
dependent,  as  will  be  seen  by  reference  to  the  proof  of  Theorem  1, 
§13,  for  although  we  knew  there  that  all  the  (r -h  l)-rowed  deter- 
minants were  zero,  we  made  use  of  this  fact  only  for  those  (r-|-l)- 
rowed  determinants  which  we  now  assume  to  be  zero.  Moreover, 
since  the  r  sets  of  constants  which  stand  in  the  first  r  rows  of  our 
matrix  are  linearly  independent,  it  follows  that  the  {r  +  l)th  row  is 
linearly  dependent  on  the  first  r.  Precisely  the  same  reasoning 
shows  that  each  of  the  subsequent  rows  is  linearly  dependent  on  the 
first  r  rows.  Accordingly,  by  Theorem  5,  §17,  any  r-f  1  rows  are 
linearly  dependent ;  and  therefore,  by  Theorem  1,  §13,  all  the  (r-f  1)- 
rowed  determinants  of  our  matrix  are  zero,  as  was  to  be  proved. 
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Still  another  method  of  facilitating  the  determination  of  the  rank 
of  a  matrix  is  by  changing  the  form  of  the  matrix  in  certain  ways 
which  do  not  change  its  rank.  In  order  to  explain  this  method,  we 
begin  by  laying  down  the  following  definition  : 

Definition  1.  By  an  elementary  transformation  of  a  matrix  we 
understand  a  transformation  of  any  one  of  the  following  forms  : 

(a)  the  interchange  of  two  rows  or  of  two  columns  ; 

(b)  the  multiplication  of  each  element  of  a  row  {or  column)  by  the 
same  constant  not  zero  ; 

(c)  the  addition  to  the  elements  of  one  row  {or  column)  of  the  prod- 
ucts of  the  corresponding  elements  of  another  row  {or  column)  by  one  and 
the  same  constant. 

It  is  clear  that  if  we  can  pass  from  a  matrix  a  to  a  matrix  b  by  one 
of  these  transformations,  we  can  pass  back  from  b  to  a  by  an  elemen- 
tary transformation. 

Definition  2.  Two  matrices  are  said  to  be  equivalent  if  it  is  possi- 
ble to  pass  from  Qne  to  the  other  by  a  finite  number  of  elementary  trans- 
formations. 

Theorem  2.    If  two  matrices  are  equivalent.,they  have  the  same  rank. 

It  is  evident  that  the  transformations  {a)  and  (5)  of  Definition  1 
do  not  change  the  rank  of  a  matrix,  since  they  do  not  affect  the  van- 
ishing or  non-vanishing  of  any  determinant  of  the  matrix.  In  order 
to  prove  our  theorem,  it  is  therefore  sufficient  to  prove  that  the  rank 
of  a  matrix  is  not  changed  by  a  transformation  {c). 

Suppose  this  transformation  consists  in  adding  to  the  elements  of 
the  |?th  row  of  a  matrix  a  h  times  the  elements  of  the  ^th  row, 
thus  giving  the  matrix  b.  Let  r  be  the  rank  of  the  matrix  a.  We 
will  first  show  that  this  rank  cannot  be  increased  by  the  transforma- 
tion, that  is,  that  all  (r  -f  l)-rowed  determinants  of  the  matrix  b  are 
zero.  By  hypothesis  all  the  (rH-l)-rowed  determinants  of  the 
matrix  a  are  zero,  and  some  of  these  determinants  are  clearly  not 
changed  by  the  transformation,  namely,  those  which  do  not  contain 
the  jt?th  row,  or  which  contain  both  the  ^th  and  the  gth  row.  The 
other  determinants,  which  contain  the  p\h  row  but  not  the  ^th,  take 
on  after  the  transformation  the  form  ^4  ±  A^-B  where  A  and  B  are 
{r  -\-  l)-rowed  determinants  of  a,  and  are  therefore  zero.  Thus  we 
see  that  the  transformation  {c)  never  increases  the  rank  of  a  matrix. 
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Moreover,  the  rank  of  b  cannot  be  less  than  that  of  a,  for  then  the 
transformation  (c?)  which  carries  b  into  a  would  increase  the  rank  of 
b,  and  this  we  have  just  seen  is  impossible. 

This  theorem  can  often  be  used  to  advantage  in  determining  the 
rank  of  a  matrix,  for  by  means  of  elementary  transformations  it  is 
often  easy  to  simplify  the  matrix  very  materially. 


EXERCISES 


Determine  the  ranks  of  the  following  matrices : 


1. 


2. 


14 

12 

6    8    2 

6 

104 

21    9   17 

7 

6 

3   4    1 

35 

30 

15   20    5 

75 

0 

116  -  39 

0 

171 

-  69 

402    123 

45 

301 

0 

87  -417 

-169 

114 

-  46 

268    82 

30 

3.  Prove  that  any  matrix  of  rank  r  can  be  reduced  by  means  of  elementary 
transformations  to  a  form  where  the  element  iii  the  ith  row  and  I'th  column  is  1 
when  i<r,  while  all  the  other  elements  of  the  matrix  are  zero. 

4.  Hence  prove  that  two  matrices  with  m  rows  and  n  columns  each  are  always 
equivalent  when  they  have  the  same  rank. 

5.  Prove  that  a  necessary  and  sufficient  condition  that  the  matrix 


ail 


«i« 


^/nl    *  •  *    (^m 

be  of  rank  0  or  1  is  that  there  exist  m  +  n  constants  ai,  •••  a,„,  ySi,  •••  /?„   such  that 

20.   Symmetrical  Matrices. 
Definition.     The  square  matrix 


a  = 


{and  also  its  determinant)  is  said  to  he  symmetrical  if  the  pairs  of  terms 
which  are  situated  symmetrically  with  respect  to  the  principal  diagonal 
are  equal.      That  is^  if  a^j  =  a^-i. 


«11 

«12-- 

'  ^In 

«21 

^22  " 

•  ^2n 

• 

•        • 

• 

• 

• 

* 

^nl 

^n2 

^nn 
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We  will  denote  by  M^  an  2-rowed  principal  minor  of  a.  It  is 
our  main  object  in  this  section  to  show  how  the  rank  of  the  symmet- 
rical matrix  may  be  determined  by  an  examination  of  the  principal 
minors  only.  This  may  be  done  by  means  of  the  following  three 
theorems. 

Theorem  1.  If  an  r- rowed  principal  minor  Mr  of  the  symmetrical 
matrix  a  is  not  zero^  while  all  the  principal  minors  obtained  by  adding 
one  row  and  the  same  column^  and  also  all  those  obtained  by  adding  two 
rows  and  the  same  two  columns^  to  M^  are  zero^  then  the  rank  of  a  is  r. 

Let  the  non-vanishing  minor  be  the  one  which  stands  in  the  upper 
left-hand  corner  of  a,  and  let  B^^  denote  the  determinant  obtained  by 
adding  the  ath  row  and  the  /3th  column  to  M^.  If  we  can  show  that 
B^^  =  0  for  all  unequal  values  of  «  and  yS  our  theorem  will  be  proved. 
Cf.  Theorem  1,  §  19.  Give  to  the  integers  a  and  yS  any  two  unequal 
values,  and  let  0  denote  the  determinant  obtained  by  adding  to  M^.  the 
ath  and  /Sth  rows  and  the  ath  and  /8th  columns  of  a.  Then  we  have, 
by  hypothesis,  M^^  0,  BJ=  0,  B^^  =  0,  C=  0.  Let  Ml  be  the  two- 
rowed  principal  minor  of  the  adjoint  of  0  which  corresponds  to  the 
complement  of  Mr  in  (7.     Then  by  Corollary  3,  §  11,  we  have 

Mi=OM,=0, 
But  Mi  =  B^^B,^-Bl,, 

Therefore  ^a8  =  ^- 

Theorem  2.  If  all  the  (r-\-iyrowed  principal  minors  of  the  sym- 
metrical matrix  a  are  zero,  and  also  all  the  {r-t  2) -rowed  principal 
minors,  then  the  rank  of  a  is  r  or  less. 

If  r  =  0,  all  the  elements  in  the  principal  diagonal  are  zero  and  all 
the  two-rowed  principal  minors  are  zero. 

That  is,  %  •  ajj  —  a  |-  =  0, 

and  therefore,  since  %  =  a^y  =  0,  a^y  =  0.  That  is,  every  element  is 
zero  and  hence  the  rank  is  zero,  and  the  theorem  is  true  in  this 
special  case. 

Now,  assume  it  true  when  r  =  Jc;  that  is,  we  assume  that  when  all 
(A?-f  l)-rowed  principal  minors  are  zero  and  all  (A?  +  2)-rowed  principal 
minors  are  zero,  the  rank  of  a  is  less  than  k-\-l.  Then  it  follows  that 
when  all  (A;-|-2)-rowed,  and  all  (^-|-3)-ro wed  principal  minors  are  zero. 
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the  rank  of  a  is  less  than  k-\-2.  For  in  this  case,  if  all  {k  -f-  l)-rowed 
principal  minors  are  zero,  the  rank  is  less  than  A;  + 1,  by  hypothesis, 
and  if  some  (^4-l)-rowed  principal  minor  is  not  zero,  the  rank  is  ex- 
actly k  -\-l,  by  the  last  theorem.  We  see  then  that  if  the  theorem  is 
true  for  r=:k  it  is  true  for  r  =  A;  -f  1.  But  we  have  proved  it  true  for 
r  =  0,  hence  it  is  true  for  all  values  of  r. 

Theorem  3.  If  the  rank  of  the  symmetrical  matrix  a.  is  r>  0,  there 
is  at  least  one  r-rowed  principal  minor  of  a  which  is  not  zero. 

For  all  (r  +  l)-rowed  principal  minors  are  zero,  and,  if  all  r-rowed 
principal  minors  were  zero  also,  the  rank  of  a  would  be  r— 1  or  less, 
by  the  last  theorem. 

We  close  with  a  theorem  of  a  somewhat  special  character  which 
will  be  found  useful  later  (cf .  Exercises  4-6,  §  50). 

Theorem  4.  If  the  rank  of  the  symmetrical  matrix  a.  is  r  >  0,  we 
may  shift  the  rows  {at  the  same  time  shifting  the  columns  in  the  same 
way^  thus  keeping  a  symmetrical)  in  such  a  way  that  no  consecutive  two 
of  the  set  of  quantities       jy[    jj^    j^         jj^ 

shall  he  zero  and  M^  ^  0  ;  M^  being  unity ^  and  the  other  M^s  being  the 
principal  minors  of  a.  of  orders  indicated  by  their  subscripts,  which  stand 
in  the  upper  left-hayid  corner  of  a  after  the  shifting. 

By  definition  we  have  M^^  ^  0.  Leaving  aside  for  the  moment 
the  special  case  in  which  all  the  elements  of  the  principal  diagonal  are 
zero,  let  us  suppose  the  element  %  is  not  zero.  Then  by  shifting  the 
^th  row  and  column  to  the  first  place,  we  have  M^  =^  0.  We  have 
thus  fixed  the  first  row  and  column,  but  we  are  still  at  liberty  to 
shift  all  the  others.  Now  consider  the  two-rowed  principal  minor 
obtained  by  adding  to  M^  one  row  and  the  same  column.  Leaving 
aside  still  the  special  case  in  which  these  are  all  zero,  let  us  suppose 
that  the  two-rowed  determinant  obtained  by  striking  out  all  the  rows 
and  columns  except  those  numbered  1  and  i^  is  not  zero.  Then,  by 
shifting  the  i^ih.  row  and  column  into  the  second  place,  we  have 
71^2  =^  0.  We  next  have  to  consider  the  three-rowed  principal  minors 
of  which  M^  is  a  first  minor.  We  can  evidently  proceed  in  this  way 
until  we  have  so  shifted  our  rows  and  columns  that  none  of  the  quan- 
tities Mq.,  My^  ...  Mr  are  zero,  unless  at  a  certain  stage  we  find  that 
all  the  principal  minors  of  a  certain  order  which  we  have  to  consider 
are  zero.  In  this  case  we  should  have«o  shifted  our  first  k  rows  and 
columns  that  none  of  the  quantities  M^,  M^,  ...  Mj^.  are  zero,  but  we 
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should  then  find  that  all  (A:+l)-rowed  principal  minors  of  which  Mj^ 
is  a  first  minor  vanish,  so  that,  however  we  may  shift  the  last  n^k 
rows  and  columns,  we  have  illf^+i  =  0.  Let  us  then  examine  the 
(A;  +  2)-rowed  principal  minors  of  which  M^.  is  a  second  minor.* 
These  can  (by  Theorem  1)  not  all  be  zero  as  otherwise  the  rank  of  a 
would  be  A?  <  r.  That  is,  if  M^^-^  =  0,  we  can  so  arrange  the  rows 
and  columns  that  Mj^^^^^^.  Thus  we  see  that  the  rows  and  col- 
umns of  a  may  be  so  shifted  that  no  consecutive  two  of  the  M'^  are 
zero.  Now,  if  il^/J._i=0,  the  above  proof  shows  that  we  can  make 
Mr ^ 0.  But  even  though  Mj.-i^O  we  can  still  make  M^^O,  for  by 
hypothesis  f  all  the  determinants  obtained  by  adding  to  M^-i  two 
rows  and  the  same  two  columns  vanish,  and  if  all  those  obtained  by 
adding  one  row  and  the  same  column  were  zero  also,  the  rank  of  a 
would  be  r—  1,  by  Theorem  1. 

A  symmetrical  matrix  is  said  to  be  arranged  in  normal  form  when 
no  consecutive  two  of  the  ilf' s  of  Theorem  4  are  zero  and  Mr  ^  0. 


-      EXERCISES 

1.   Determine  the  ranks  of  the  following  matrices : 


2 

1 

11 

2 

0 

4   10   1 

1 

0 

4 

-1 

^ 

4 

8   18   7 

11 

4 

56 

5 

10 

18   40   17 

2 

-1 

5 

-G 

1 

7   17   3 

1 

0 

0 

1 

4 

0 

1        h          d 

0 

1 

0 

2 

5 

1 

0        c          e 

0 

0 

1 

3 

6 

t 

I 

c      2bc    cd-^he 

1 

2 

3 

14 

32 

d 

e   cd-^be    2de 

4 

5 

6 

32 

77 

2.  By  a  skew-symmetric  determinant,  or  matrix,  is  meant  one  in  which  aij= 
—  Gji  (and  therefore  an  =  0) . 

Establish  for  such  matrices  theorems  similar  to  Theorems  1,  2,  3  of  this  section. 

3.  By  considering  the  effect  of  changing  rows  into  columns,  prove  that  a  skew- 
symmetric  determinant  of  odd  order  is  always  zero. 

4.  Prove  that  the  rank  of  a  skew-symmetric  matrix  is  always  even. 


*The  tacit  assumption  is  here  made  that  when  k  =  r  —1,  r<in,  as  otherwise  Mk+2 
would  have  no  meaning.  The  case  r  =  n  can,  however,  obviously  not  occur  here,  for 
then  we  should  have  Mk  +  i  =  a  =j/^0. 

t  Here  again  we  assume  that  r  <  w,  for  if  r  =  n,  M,-  =  a^0. 


CHAPTEH  VI 

LINEAR  TRANSFORMATIONS   AND  THE  COMBINATION 

OF   MATRICES 

21.  Matrices  as  Complex  Quantities.  We  have  said  in  §  7  that  a 
matrix  of  m  rows  and  n  columns  is  not  a  quantity,  but  a  set  of  mn 
quantities.  This  statement  is  true  only  if  we  restrict  the  term 
quantity  to  the  real  and  complex  quantities  of  ordinary  algebra.  A 
moment's  reflection,  how^ever,  will  show  that  the  conception  of  quan- 
tity as  used  in  arithmetic  and  algebra  has  been  gradually  enlarged 
from  the  primitive  conception  of  the  positive  integer  by  using  the 
word  quantity  to  denote  entities  which,  at  an  earlier  stage,  would 
not  have  been  regarded  as  quantities  at  all,  as,  for  instance,  nega- 
tive quantities.  We  will  consider  here  only  one  of  these  extensions, 
namely  the  introduction  of  complex  quantities,  as  this  will  lead  us  to 
look  at  our  matrices  from  a  broader  point  of  view. 

If  we  have  objects  of  two  or  more  different  kinds  which  can  be 
counted  or  measured,  and  if  we  consider  aggregates  of  such  objects, 
we  get  concrete  examples  of  complex  quantities,  as,  for  instance, 
5  horses,  3  cows,  and  7  sheep.  A  convenient  way  to  write  such  a 
complex  quantity  is  (5,  3,  7),  it  being  agreed  that,  in  the  illustra- 
tion we  are  considering,  the  first  place  shall  always  indicate  horses, 
the  second  cows,  and  the  third  sheep.  In  the  abstract  theory  of 
complex  quantities  we  do  not  specify  any  concrete  objects  such  as 
horses,  cows,  etc.,  but  merely  consider  sets  of  quantities  (couples, 
triplets,  etc.),  distinguishing  these  quantities  by  the  position  they 
occupy  in  our  symbol.  Such  a  complex  quantity  we  often  find  it 
convenient  to  designate  by  a  single  letter, 


a, 


=  («,  ^,  c) 


just  as  in  ordinary  algebra  we  denote  a  fraction  (|  for  instance), 
which  really  involves  two  numbers,  by  a  single  letter.  We  speak 
here  of  the  simple  quantities  a,  6,  c  of  which  a  is  composed  as  its  first, 
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second,  third  components  ;  and  we  call  two  complex  quantities  equal 
when  and  only  when  the  components  of  one  are  equal  respectively  to 
the  corresponding  components  of  the  other.  Similarly  a  complex  quan- 
tity is  said  to  vanish  when  and  only  when  all  of  its  components  are  zero. 
What  makes  it  worth  while  to  speak  of  such  sets  of  quantities  as 
complex  quantities  is  that  it  is  found  useful  to  perform  certain  alge- 
braic operations  on  them.  By  the  sum  and  difference  of  two  complex 
quantities  a^  =  (a„b„e,),     a,  =  {a,,  b„  c,) 

we  mean  the  two  new  complex  quantities 

«i  +  «2  =  («i  +  ^2'  h  +  K  ^1  +  ^2)^  oti  —  «2  =  (%  —  ^2^  h  —  h^  ^1  —  ^2)-* 

When  it  comes  to  the  question  of  defining  what  we  shall  under- 
stand by  the  product  of  two  complex  quantities,  things  are  by  no 
means  so  simple.  It  is  necessary  here  to  lay  down  some  rule  accord- 
ing to  which,  when  two  complex  quantities  are  given,  a  third,  which 
we  call  their  product,  is  determined.  Such  rules  may  be  laid  down 
in  an  infinite  variety  of  ways,  and  each  such  rule  gives  us  a  different 
system  of  complex  quantities,  f 

We  come  now  to  the  subject  of  matrices.  A  matrix  of  m  rows 
and  n  columns  being  merely  a  set  of  mn  quantities  (which  we 
assume  to  be  either  real  quantities  or  the  ordinary  complex  quantities 
of  elementary  algebra)  arranged  in  a  definite  order,  is,  according  to 
the  point  of  view  we  have  explained,  a  complex  quantity  with  mn 
components;  and  it  is  only  a  special  application  of  the  theory  of 
complex  quantities  which  we  have  sketched,  when  we  lay  down  the 
following  definitions: 

Definition  1.    A  matrix  is  said  to  he  zero  when  and  only  when  all 

of  its  elements  are  zero. 

Definition  2.  Two  matrices  are  said  to  he  equal  when  and  only 
when  they  have  the  same  numher  of  rows  and  of  columns^  and  every 
element  of  one  is  equal  to  the  corresponding  element  of  the  other. 

*  That  this  is  the  natural  meaning  to  be  attached  to  the  terms  sum  and  difference 
will  be  seen  by  reference  to  the  concrete  illustration  given  above. 

t  If,  in  particular,  v^e  wish  to  introduce  the  ordinary  system  of  complex  quanti- 
ties of  elementary  algebra,  we  use  a  system  of  couples,  and  define  the  product  of  two 
couples,  cLi  =  (at,  61),     a^  =  (^2,  ^2), 

by  the  formula  Uia^  =  {aia^  —  h^hiy  a\b2.  -I-  azh-C). 

For  further  details  of.  Burkhardt's  Finiktionentheorie,  §§  2,  3. 
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Definition  3.  By  the  sum  {or  difference)  of  two  matrices  of  m 
rows  and  n  columns  each^  we  understand  a  matrix  of  m  rows  and  n  col- 
umns^ each  of  whose  elements  is  the  sum  (or  difference)  of  the  corre- 
sponding elements  of  the  given  matrices. 

In  order  to  distinguish  them  from  matrices,  we  will  call  the 
ordinary  quantities  of  algebra  (real  quantities  and  ordinary  complex 
quantities)  scalars. 

Before  proceeding,  as  we  shall  do  in  the  next  section,  to  the 
definition  of  the  product  of  two  matrices,  we  will  define  the  product 
of  a  matrix  and  a  scalar. 

Definition  4.  If  a.  is  a  matrix  *  and  k  a  scalar,  then  hy  the  prod- 
uct ksi  or  ak  we  understand  the  matrix  each  of  whose  elements  is  k 
times  the  corresponding  element  of  a. 

As  an  obvious  consequence  of  our  definitions  we  state  the 
theorem :   . 

Theorem.  All  the  laws  of  ordinary  algebra  hold  for  the  addition 
or  subtraction  of  matrices  and  their  multiplication  by  scalars. 

For  instance,  if  a,  b,  c  are  matrices,  and  k^  I  scalars, 

a  +  b  =  b  4-  a, 

a+(b  +  c)  =  (a  +  b)  +  c, 
kdi-\-kh  =  k{di  4-  b), 
kdi  +  ldi  =  (k-\-l)ei,-\ 

EXERCISE 

If  r\  and  r^  are  the  ranks  of  two  matrices  and  R  the  rank  of  their  sum,  prove 
that  ,^  ^ 

22.  The  Multiplication  of  Matrices.  Up  to  this  point  we  have  con- 
sidered matrices  with  m  rows  and  n  columns.  For  the  sake  of  sim- 
plicity of  statement,  we  shall  confine  our  attention  from  now  on  to 
square  matrices,  that  is  to  the  case  m  =  n.    This  involves  no  real  loss 

*  The  notation  here  used,  matrices  being  denoted  by  heavy-faced  type,  will  be 
systematically  followed  in  this  book. 

t  We  add  that,  as  a  matter  of  notation,  we  shall  write 

(_l)a=-a. 
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of  generality  provided  we  agree  to  consider  a  matrix  of  m  rows  and 
n  columns,  where  m^^n,  as  equivalent  to  a  square  matrix  of  order 
equal  to  the  larger  of  the  two  integers  m,  n  and  obtained  from  the 
given  matrix  by  filling  in  the  lacking  rows  or  columns  with  zeros. 

The  question  now  presents  itself:  How  shall  we  define  the  prod- 
uct of  two  square  matrices  of  the  same  order  ?  It  must  be  clearly 
understood  that  we  are  logically  free  to  lay  down  here  such  definition 
as  we  please,  and  that  the  definition  we  select  is  preferable  to  others 
not  on  any  a  priori  grounds,  but  only  because  it  turns  out  to  be  more 
useful.  We  select  the  following  definition,  which  is  suggested  *  by 
the  multiplication  theorem  for  determinants : 

Definition  1.  The  product  ab  of  two  square  matrices  of  the  nth 
order  is  a  square  matrix  of  the  nth  order  in  which  the  element  which  lies 
in  the  ith  row  and  jth  column  is  obtained  by  multip)lying  each  element  of 
the  ith  row  of  a  by  the  corresponding  element  of  the  jth  column  of  b  and 
adding  the  results. 

Let  us  denote  by  a^y  and  h^j  the  elements  in  the  ^th  row  and  yth 
column  of  a  and  'b  respectively,  or,  as  we  will  say  for  brevity,  the 
element  (^,  j)  of  these  matrices.  Then,  according  to  our  definition, 
the  element  (i,  j)  of  the  product  ab  is 

(1)  a  i^h^  +  a^2^2y  +  •  •  •  +  «iAy^ 

while  the  element  (z,  j)  in  the  matrix  ba  is 

(2)  a^jbi^  +  a^jb^^  +  •  •  •  +  a^jbin- 

Since  the  two  quantities  (1)  and  (2)  are  not  in  general  equal,  we 
obtain 

Theorem  ,1.  The  multiplication  of  matrices  is  not  in  general  com- 
mutative^ that  is,  in  general  l    ,  t^ 

Let  us  now  consider  a  third  matrix  c  whose  element  (^,  j)  is  c^-j, 
and  form  the  product  (ab)c.     The  element  (^,  j)  of  this  matrix  is 

.    (3)  +     

*  Historically  this  definition  was  suggested  to  Cayley  by  the  consideration  of  the 
composition  of  linear  transformations;  cf.  §  23. 
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On  the  other  hand,  the  element  (^,  j)  of  the  matrix  a(bc)  is 

(4)  +  «t2(^21^1i  +  ^22%  +   •  •  •   +  h^^C^j) 


+ 


Since  the  two  quantities  (3)  and  (4)  are  equal,  we  have  established 
Theorem  2.      The  multiplication  of  matrices  is  associative^  that  is, 

(ab)c  =  a(bc). 

Finally,  since  the  element  (^,  i)  of  the  matrix  a  (b  +  c)  is  clearly 
equal  to  the  sum  of  the  elements  (^,  j)  of  the  matrices  ab  and  ac,  we 
have  the  result 

Theorem  3.      The  multiplication  of  matrices  is  distributive,  that  is, 

a(b  +  c)  =  ab  +  ac. 

Besides  the  commutative,  associative,  and  distributive  laws,  there 
is  one  other  principle  of  elementary  algebra  which  is  of  constant  use, 
namely,  the  principle  that  a  product  cannot  vanish  unless  at  least  one 
of  the  factors  is  zero.  Simple  examples  show  that  this  is  not  true  in 
the  algebra  of  matrices.     We  have,  for  instance. 


(5) 


a 


11 


a 


12 


a 


21 


a. 


22 


a 


31 


a 


32 


31 


32 


33 


=  0, 


whatever  the  values  of  the  a's  and  Fs  may  be.     Hence 

Theorem  4.  From  the  vanishing  of  the  product  of  two  or  more 
matrices,  we  cannot  infer  that  one  of  the  factors  is  zero. 

The  process  of  cancelling  out  non-vanishing  factors  which  enter 
throughout  an  equation  will,  therefore,  be  inadmissible  in  the  algebra 
of  matrices. 

We  next  state  a  result  which  follows  at  once  from  the  similarity 
between  the  theorem  for  the  multiplication  of  determinants  and  our 
definition  of  the  product  of  two  matrices : 

Theorem  5.  The  determinant  of  a  matrix  which  is  obtained  hy 
midtiplying  together  two  or  more  matrices  is  equal  to  the  product  of  the 
determinants  of  these  matrices. 
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The  conception  of  the  conjugate  of  a  matrix,  as  defined  in  §  7, 
Definition  2,  is  an  important  one,  and  the  following  theorem  concern- 
ing it  is  often  useful: 

Theorem  6.  The  conjugate  of  the  product  of  any  number  of 
matrices  is  the  product  of  their  conjugates  taken  in  the  reverse  order. 

In  order  to  prove  this  theorem  we  first  notice  that  its  truth  in  the 
case  of  two  matrices  follows  at  once  from  the  definition  of  the  prod- 
uct of  two  matrices.  Its  truth  will  therefore  follow  in  all  cases  if, 
assuming  the  theorem  to  be  true  for  the  product  of  ^  —  1  matrices, 
we  can  prove  that  it  is  true  for  the  product  of  n  matrices.  Let  us 
write 

Then,  from  what  we  have  assumed, 

D  =  Q.fi  •  •  •  3-3  0.2', 

where  we  use  accents  to  denote  conjugates.     Accordingly, 
(aia2  •  •  •  SLn)'  =  (aib)'  =  b'aj  =  a^  •  •  •  a^  ai, 

and  our  theorem  is  proved. 

In  conclusion  we  lay  down  the  following : 

Definition  2.  A  square  matrix  is  said  to  he  singular  if  its  deter- 
minant is  zero. 

According  to  the  convention  made  at  the  beginning  of  this  sec- 
tion, it  will  be  seen  that  all  matrices  which  are  not  square  are 
singular. 

EXERCISES 

1.  Definition.  A  matrix  a  is  called  a  divisor  of  zero  if  a  matrix  b  different 
from  zero  exists  such  that  either  ab  =  0  or  ba  =  0. 

Prove  that  every  matrix  one  of  whose  rows  or  columns  is  composed  wholly  of 
zeros  is  a  divisor  of  zero. 

2.  If  it  is  possible  to  pass  from  a  to  b  by  means  of  an  elementary  transforma- 
tion (cf.  §  19,  Definition  1),  prove  that  there  either  exists  a  non-singular  matrix  c 
such  that 

ac  =  b, 

or  a  non-singular  matrix  d  such  that 

da  =  b. 

p 
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3.  If  all  the  elements  of  a  matrix  are  real,  and  if  the  product  of  this  matrix 
and  its  conjugate  is  zero,  prove  that  the  matrix  itself  is  zero. 

4.  If  the  corresponding  elements  of  two  matrices  a  and  b  are  conjugate  imagi- 
naries,  and,  b'  being  the  matrix  conjugate  to  b,  if 

ab'  =  0,  then  a  =  b  =  0. 


23.  Linear  Transformation.  Before  going  farther  with  the 
theory  of  matrices  we  will  take  up,  in  this  section  and  the  next,  the 
closely  allied  subject  of  linear  transformation,  which  may  be  regarded 
as  one  of  the  most  important  applications  of  the  theory  of  matrices. 

In  algebra  and  analysis  we  frequently  have  occasion  to  introduce, 
in  place  of  the  unknowns,  or  variables,  we  had  originally  to  deal 
with,  certain  functions  of  these  quantities  which  we  regard  as  new 
unknowns  or  variables.  Such  a  transformation,  or  change  of  vari- 
ables, is  particularly  simple,  and  for  many  purposes  particularly 
important,  if  the  functions  in  question  are  homogeneous  linear  poly- 
nomials. It  is  then  called  a  homogeneous  linear  transformation,  or, 
as  we  shall  say  for  brevity,  simply  a  linear  transformation.  If  x^^  •  •  •  x^ 
are  the  original  variables,  and  x[.,  •  •  •  x'^  the  new  ones,  we  have,  as  the 
formulae  for  the  transformation. 


J  _ 


X-^  —  <^22  ^■^-T'   "  '  ~r  <^in*^! 


n") 


^  ^n  —  ^nV^l  H~   •  ■  •  T*  (^nnP^n* 


The  square  matrix 


a  = 


a 


11 


a 


m 


a 


nl 


a. 


is  called  the  matrix  of  the  transformation,  and  the  determinant  of 
this  matrix,  which  we  will  represent  by  a,  is  called  the  determinant 
of  the  transformation.  _Inasmuch  as  the  transformation  is  com- 
pletely determined  by  its  matrix,  no  confusion  will  arise  if  we  speak 
of  the  transformation  a. 

In  most  cases  where  we  have  occasion  to  use  a  transformation  it  is 
important  for  us  to  be  able,  in  the  course  of  our  work,  to  pass  back  to 
the  original  variables,  and  for  this  purpose  it  must  be  possible,  not 
merely  to  express  x[,  •••  xl  as  functions  of  x^^  •••  a;„,  but  also  to  express 
a^j,  •••  Xn  as  functions  of  rrj,  •••  x'^.     In  the  case  of  linear  transforma- 
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tions  this  can  in  general  be  done.  For  the  equations  of  the  transfor- 
mation may  be  regarded  as  non-homogeneous  linear  equations  in 
x^,  '■'  Xn'>  and  if  the  determinant  a  of  the  transformation  is  not  zero, 
they  can  be  solved  and  give 

a  a 


X, 


a 


A 


X 


a 


nf 


where  J.^^,  •  •  •  A^n  are  the  cof actors  of  a^^^  •  •  •  a^n  in  a. 

This  transformation  A  is  called  the  inverse  of  the  transformation 
a,  but  it  must  be  remembered  that  it  exists  only  \i  a^O.  A  linear 
transformation  for  which  a  =  0  is  called  a  singular  transformation. 
If  a  is  non-singular,  its  inverse  A  is  also  non-singular,  since  the  deter- 
minant of  A  is  a~^(cf.  Corollary  2,  §  11). 

Definition.     The  special  linear  transformation 


whose  matrix  is 


1     0 
0     1 


1  = 


0 
0 


X. 


ni 


0    0    .-.    1 

is  called  the  identical  transformation. 

The  determinant  of  this  transformation  is  1. 

We  turn  now  to  the  subject  of  the  composition  of  linear  trans- 
formations. If  we  introduce  a  new  set  of  variables  x'  as  functions 
of  the  original  variables  x^  and  then  make  a  second  transformation 
by  introducing  a  third  set  of  variables  x^'  as  functions  of  the  vari- 
ables x\  these  two  transformations  can  obviously  be  combined  and 
the  variables  x^^  expressed  directly  in  terms  of  the  a;'s.  If  the  two 
transformations  which  we  combine  are  linear  transformations,  it  is 
readily  seen  that  the  resulting  transformation  will  also  be  linear. 
The  precise  formulae  are  important  here,  and  for  the  sake  of  simplic- 
ity we  will  write  them  in  the  case  of  three  variables,  a  case  which 
will  be  seen  to  be  perfectly  typical  of  the  general  case. 
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a^g  =  O^^Xj^  +  ^32^2  "'■  ^33-^3' 

Replacing   the  x'^s  in  b  by  their 


Let 

X-^  =  (^ii^i    1    ^i2'^2    '    ^13*^3' 
•^2  ^^  ^21*^1    '    ^22*^2    '    ^23*^3' 

be  two  linear   transformations, 
values  from  a,  we  get 

f  <  =  (^11^1  +  «21^2  +  «31^3)^1 
I    V^12    11    '    ^22    12    '    ^32^13/'^2 
+  {Cli^O^-^  +  <^23^12  +  ^33^13)^3' 

X^  =  (<^ii^21  "^  ^21^22  "l~  ^31^23/^1 
+  (<^i2^21  ~^~  ^22^22  "^  ^32^23/^2 
+  (^i3C>2i  +  <^23^22  "^  ^33^23/^3' 

^3  ~  (^11^31  +  ^21^32  +  ^31^33)^1 
4-  (^12^31  "^  ^22^32  "^  ^32^33/^2 
I         +  (^13^31  "^  ^23^32  "^  ^33^33)^3* 

It  will  be  seen  that  the  matrix  of  this  transformation  is  ba. 
Hence, 

Theorem.  If  we  pass  from  the  variables  x  to  the  variables  x^  by  a 
linear  transformation  of  matrix  a,  and  from  the  variables  x'  to  the  vari- 
ables x^^  by  another  linear  transformation  of  matrix  b,  then  the  linear 
transformation  of  matrix  ba  will  carry  us  directly  from  the  variables  x 
to  the  variables  x"  ,^ 

24.  CoUineation.  We  come  now  to  an  important  geometrical 
application  of  the  subject  of  linear  transformation.  For  the  sake  of 
simplicity  we  begin  with  the  case  of  three  variables,  which  we  will 
regard  as  the  homogeneous  coordinates  of  points  in  a  plane. 

The  equations  .  ^,  ^  ^^^  ^  ^^^  ^  ^^^^ 

(1)  '   y'  =  a^x-^b^y^-cj^, 

.  t'  =  a^x  4-  b^y  +  c^t 

*  This  result  may  be  remembered  conveniently  by  means  of  the  following  symbolic 
notation,  which  is  often  convenient.  Let  us  denote  the  transformation  a  by  the 
symbolic  equation  x'  =  a(a;),  and  the  transformation  b  by  x"  =  h(x').  The  result  of 
combining  these  two  transformations  is  then  x"  =  b(a(x))  or  simply  x"  =  ba(x). 
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may  be  regarded  as  defining  a  transformation  of  the  points  of  the 
plane ;  that  is,  if  (a;,  y,  t)  is  an  arbitrarily  given  point,  we  can  com- 
pute, by  means  of  (1),  the  coordinates  (x\  y\  t^)  of  a  second  point 
into  which  we  regard  the  first  point  as  being  transformed.  The 
only  exception  is  when  the  computed  values  of  x\  y\  t'  are  all  three 
zero,  in  which  case  there  is  no  point  into  which  the  given  point  is 
transformed.  This  exceptional  case  can  clearly  occur  only  when  the 
determinant  of  the  transformation  (1)  is  zero.  Let  us  then  confine 
our  attention  to  non-singular  linear  transformations.  In  this  case, 
not  only  does  every  point  (x^  y,  t)  correspond  to  a  definite  point 
{x\  y\  t%  but  conversely,  every  point  {x\  y\  t')  corresponds  to  a 
definite  point  (x^  y^  t),  since  the  transformation  (1)  now  has  an  inverse 

x=~lx'+^-y^  +  ~H\ 
D  D^       D 


(2) 


where  D  is  the  determinant  of  (1),  and  A^^  B^^  0^  are  the  cof actors 
ini>. 

The  points  (rr,  y,  t)  of  the  line 

(3)  ax  +  ^y-{-yt  =0 

are  transformed  by  means  of  the  non-singular  transformation  (1) 
into  points  of  another  line, 

as  we  see  by  using  formulse  (2).     Conversely  every  point  of  the  line 

(4)  corresponds,  as  we  see  by  using  (1),  to  a  point  on  (3).  That  is, 
the  transformation  establishes  a  one-to-one  correspondence  between 
the  points  on  the  two  lines  (3)  and  (4),  or,  as  we  say,  it  transforms 
the  line  (3)  into  the  line  (4).  On  account  of  this  property  of  trans- 
forming straight  lines  into  straight  lines,  the  transformation  is  called 
a  collineation.  The  transformation  is  also  known  as  a  projective 
transformation^  for  it  may  be  shown  that  it  can  be  effected  by  pro- 
jecting one  plane  on  to  another  by  means  of  straight  lines  radiating 
from  a  point  in  space. 
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What  we  have  here  said  in  the  case  of  two  dimensions  applies 
with  no  essential  change  to  three  dimensions.     The  transformation 


(5) 


X 


f  _ 


7/'  =  a^x  +  b^i/  +  c^z  +  dj., 
z'  =  a^x  +  Jgj/  +  c^z  +  c?3^, 
t^  —  a^x  -\-b^  +  c^z  +  d^ 


gives  us,  provided  its  determinant  is  not  zero,  a  one-to-one  trans- 
formation of  the  points  of  space,  which  carries  over  planes  into 
planes,  and  therefore  also  straight  lines  into  straight  lines,  and  is 
called  a  collineation  or  projective  transformation  of  space.  The  same 
idea  can  be  extended  to  spaces  of  higher  dimensions. 

Quite  as  important  is  the  case  of  one  dimension.     The  transfor- 


mation 
(6) 


x'  =  a-^x  +  5j^, 
t'  =  a^x  +  bjt 


gives  us,  provided  its  determinant  is  not  zero,  a  one-to-one  trans- 
formation of  the  points  on  a  line.  This  we  call  a  projective  trans- 
formation of  the  line,  the  term  collineation  being  in  this  case 
obviously  inadequate. 

It  is  possible,  although  for  most  purposes  not  desirable,  to  express 
the  projective  transformations  (6),  (1),  (5)  in  one,  two,  and  three 
dimensions  in  terms  of  non-homogeneous,  instead  of  homogeneous 
coordinates.     We  thus  get  the  formulae 


(7) 


(8) 


^,  _a^X-\-b^ 
a^X-\-  b^^ 

a^X -\-  b^Y -{-  c^ 
a^X -{-b^Y-\-  c^ 


(9) 


7  _  ^2 


aoX+boY-\-c; 


j^f  ^  a^X-^-b^Y-^c^Z+d^ 
~  a^X+b^Y+c^Z^d^ 

a^X+b^Y-\-c^Z+d^ 
a^X-^b^Y+c^Z-fTl^ 

a^X  +  b^Y -\-  c^Z  +  6?3 
a.X+b,Y+c.Z+d' 


Y'  = 


Z'  = 


These  fractional  forms  may,  in  particular,  be  used  to  advantage 
in  case  their  denominators  reduce  to  mere  constants.  This  special 
case,  which  is  known  as  an  affine  transformation^  may  clearly  be  char- 
acterized  by  saying   that  all   finite  points  go  into  finite  points.* 

*  If  we  consider  the  still  more  special  case  in  which  the  constant  terms  in  the 
numerators  of  (8)  and  (f>)  are  zero,  that  is,  affine  transformations  in  which  the  orifrin 
is  transformed  into  itself,  we  see  that  our  formulas  (8)  and  (9)  have  the  form  (6)  and 
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These  affine  transformations  are  of  much  importance  in  mechanics, 
where  they  are  known  as  homogeneous  strains;  cf.,  for  instance, 
Webster's  Dynamics  (Leipzig,  Teubner),  pp.  427-444. 

Although  we  propose  to  leave  the  detailed  discussion  of  singular 
transformations  to  the  reader  (see  Exercise  1  at  the  end  of  this  sec- 
tion), we  will  give  one  theorem  concerning  them. 

Theorem  1.  If  the  points  P^,  P^,  •••  are  carried  over  hy  a  singu- 
lar projective  transformation  into  the  points  P'^^  P'^^  •••,  then^  if  our 
transformation  is  in  one  dimension^  the  points  P'  will  all  coincide; 
if  in  two  dimensions^  they  ivill  all  be  collinear ;  if  in  three  dimensions^ 
they  will  all  he  complanar^  etc. 

Suppose,  for  instance,  that  we  have  to  deal  with  two  dimensions. 
Since  the  determinant  of  the  collineation  (1)  is  supposed  to  be  zero, 
the  three  polynomials  in  the  second  members  of  (1)  are  linearly  de- 
pendent ;  that  is,  there  exist  three  constants,  ky^  k^<^  ^3,  not  all  zero, 
and  such  that  for  all  values  of  x,  y^  t, 

(10)  k^J  +  k^y'  +  k^t^  =  0. 

Accordingly  all  points  {x\  y\  t')  obtained  by  this  transformation 
lie  on  the  line  (10). 

Similar  proofs  apply  to  the  cases  of  one  dimension  and  of  three 
or  more  dimensions. 

Theorem  2.  Any  three  distinct  points  on  a  line  may  he  carried 
over  respectively  info  any  three  distinct  points  on  the  line  hy  one^  and 
only  one^  projective  transformation. 

Let  the  three  initial  points  be  P^  P^,  Pg,  with  homogeneous  coor- 
dinates {xy,  ^j),  (0^2,  ^2)'  (%  ^3)  respectively,  and  let  the  points  into 
which  we  wish  them  transformed  be  Pj,  Pg,  P3  with  coordinates 
{x\^  ^J),  (0^2,  ^2)'  (^3'  ^3')*     '^^^  projective  transformation 

x^  =  ax  -\-  ^t, 

t'  =  ryX-\-    St 

(1)  respectively.  Thus  (6)  may  be  regarded  either  as  the  general  projective  transfor- 
mation of  a  line  (if  x,  t  are  regarded  as  homogeneous  coordinates)  or  as  a  special 
afiBne  transformation  of  the  plane  (if  x,  t  are  regarded  as  non-homogeneous  coordi- 
nates). Similarly  (1)  may  be  regarded  either  as  the  general  projective  transformation 
of  a  plane,  or  as  a  special  aflfine  transformation  of  space. 
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carries  over  any  given  point  (a;,  t)  into  a  point  (x\  t')  whose  position 
depends  on  the  values  of  the  constants  a,  yS,  7,  3.  Our  theorem  is 
true  if  it  is  possible  to  find  one,  and,  except  for  a  constant  factor 
which  may  be  introduced  throughout,  only  one,  set  of  seven  con- 
stants—  four,  a,  yS,  7,  8,  and  three  others,  /o^,  p^i  p^,  none  of  which  is 
zero  —  which  satisfy  the  six  equations 


Pix[  =  ax^  4-  /3^i, 


p^x'^=ax^-^/3t^, 


9z^z  =  7^3  +  ^^3- 


Since  the  a;'s  and  ^'s  are  all  known,  we  have  here  six  homogeneous 
linear  equations  in  seven  unknowns.  Hence  there  are  always  solu- 
tions other  than  zeros,  the  number  of  independent  ones  depending  on 
the  rank  of  the  matrix  of  the  coefficients.  Transposing  and  rear- 
ranging the  equations,  we  have 


x^a  +  t^l^ 


x^a  4-  ^o/3 


XoOt  +  to0 


^i7  +  ^1^  -  APi 


2^27  +  ^2^ 


^37  +  ^3^ 


=  0, 
=  0, 

-  4/^3  =  0, 

-  4/)3  =  0. 


The  matrix  of  these  equations  is  of  rank  six.     For  consider  the 
determinant  of  the  first  six  columns  with  its  sign  reversed, 


D  = 


x^ 

X(\ 

0 
0 


X. 


x^ 


0        0 


0       0 


x. 


x\ 
0 

0 
0 
0 


0 

x\ 
0 

^; 
0 

0 


Since  P^,   Pg'   ^3   ^^®    distinct,  there   exist   two   constants  <?j,   c^^ 
neither  of  which  is  zero,  such  that 


11  """     22  "■    *^3  ~~'      ' 

Hh  -^Hh  4-^3  =0. 
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Hence,  adding  to  the  fifth  row  of  D  c^  times  the  first  row  and  c^  times 
the  second,  and  to  the  sixth  row  c^  times  the  third  row  and  Cg  times 
the  fourth,  we  have 


D  = 


x^ 

0 

0 

0 

0 


0 
0 

0 
0 


0 
0 

0 

0 


x\ 
0 


^1^1 


0 


^2 

0 

(?2^2 


^1^1      ^2^2 


(?j(?2 


*^2    2 


and  this  is  not  zero,  since  P\  and  P^  are  distinct  as  well  as  P^ 

and  Pg- 

In  the  same  way  we  see  that  the  determinants  obtained  by 
striking  out  the  sixth  and  the  fifth  columns  respectively  of  the 
matrix  are  not  zero.  Accordingly,  by  Theorem  4,  §  17,  we  see  that 
the  equations  have  a  solution  in  which  none  of  the  quantities 
Pv  9v  Ps  ^^^  zero,  and  that  every  solution  is  proportional  to  this 
one.  All  these  solutions  obviously  yield  the  same  projective  trans- 
formation of  the  line. 

Corollary.     The  transformation  just  determined  is  non-singular. 

This  follows,  by  a  reference  to  Theorem  1,  from  the  fact  that  it 
does  not  carry  P^,  Pg,  Pg  into  a  single  point. 


EXERCISES 

1.  Discuss  singular  projective  transformations  in  one,  two,  and  three  dimen- 
sions ;  noting,  in  particular,  the  effect  of  the  rank  of  the  matrix  of  the  transfor- 
mation, first,  on  the  distribution  of  the  points  which  have  no  corresponding  points 
after  the  transformation,  and  secondly,  on  the  distribution  of  the  points  into  which 
no  points  are  carried  over  by  the  transformation. 

2.  Prove  that  any  four  complanar  points  no  three  of  which  are  collinear  may 
be  carried  over  into  any  four  points  in  the  plane,  no  three  of  which  are  collinear, 
by  one  and  only  one  collineation. 

3.  State  and  prove  the  corresponding  theorem  in  n  dimensions. 

4.  Prove  that  the  transformation  from  a  first  system  of  homogeneous  coordi- 
nates to  a  second  is  effected  by  a  non-singular  linear  transformation.  Consider 
the  case  of  one,  two,  and  three  dimensions. 
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5.  Prove  that  a  projective  transformation  in  space  effects  on  every  plane  a 
two-dimensional,  and  on  every  line  a  one-dimensional,  projective  transformation, 
while  at  the  same  time  the  positions  of  the  plane  and  line  are  changed. 

[Suggestion.  If  p  and  pf  are  any  two  corresponding  planes,  assume  in  any  way  a 
pair  of  perpendicular  axes  in  each  of  them,  and  denote  by  {x^,  y^,  f^),  and  (xj,  ?/{,  i[) 
respectively  the  systems  of  two-dimensional  homogeneous  coordinates  based  on  these 
axes.  Then  show,  by  using  the  result  of  Exercise  4,  that  the  transformation  of  one 
plane  on  the  other  will  be  expressed  by  writing  x{,  y[^  t[  as  homogeneous  linear  poly- 
nomials in  X, ,  ?/j,  ty  ] 

25.  Further  Development  of  the  Algebra  of  Matrices.  We  proceed 
to  establish  certain  further  properties  of  matrices,  leaving,  however, 
much  to  the  reader  in  the  shape  of  exercises  at  the  end  of  the  section. 

The  theory  of  linear  transformations  suggests  to  us  at  once  certain 
properties  of  matrices.     The  first  of  these  is  : 

Theorem  1.     The  matrix 

1     0    0 

0    1  ......0 


1  = 


0     0 


has  the  property  that  if  a  is  any  matrix  whatever 

la  =  al  =  a. 

For  the  linear  transformation  of  which  a  is  the  matrix  will  evi- 
dently not  be  changed  by  being  either  followed  or  preceded  by  the 
identical  transformation  of  which  I  is  the  matrix. 

If  we  do  not  wish  to  use  the  idea  of  linear  transformation,  we  may 
prove  the  theorem  directly  by  actually  forming  the  products  la  and  al. 

This  theorem  tells  us  that  I  plays  in  the  algebra  of  matrices  the 
same  role  that  is  played  by  1  in  ordinary  algebra.  For  this 
reason  I  is  sometimes  called  the  unit  matrix  or  idemf actor. 

Let  us  now  consider  any  non-singular  linear  transformation  and 
its  inverse.  These  two  transformations  performed  in  succession  in 
either  order  obviously  lead  to  the  identical  transformation.  This 
gives  us  the  theorem  : 

Theorem  2.     If  a^^-'-a 


a  = 


11 


m 


a 


m 


a. 
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is  a  non-singular  matrix  of  determinant  a,  and  if  A^j  denote  in  the  ordi- 
nary way  the  cof  actors  of  the  elements  of  a,  the  matrix 


A 


11 


A 


m 


a 


Ap 
a 


a 


a 


called  the  inverse  of  a,  and  denoted  by  Si~\  is  a  non-singular  matrix 
which  has  the  property  that 

aa*i  =  a~ia  =  I. 

This  suggests  that  we  define  positive  and  negative  integral 
powers  of  matrices  as  follows  : 

Definition  1.  If  p  is  any  positive  integer  and  a  any  matrix  we 
understand  by  3F  the  product  aa  •  •  •  a  to  p  factors.  If  di  is  a  nonr 
si^igular  matrix.,  we  define  ^its  negative  and  zero  powers  by  the  for- 
mulce 

ei-p  =  (a-iy,     a«  =  I. 

From  this  definition  we  infer  at  once 
Theorem  3.     The  laws  of  indices 


pq 


a^a«  =  a^+S    {aPy  =  a 

hold  for  all  matrices  when  the  indices  p  and  q  are  positive  integers.,  and 
for  all  non-singular  matrices  when  p  and  q  are  any  integers. 

We  turn  now  to  the  question  of  the  division  of  one  matrix  by 
another.  We  naturally  define  division  as  the  inverse  of  multiplica- 
tion, and,  since  multiplication  is  not  commutative,  we  thus  get  two 
distinct  kinds  of  division  ;  a  divided  by  b  being  on  the  one  hand  a 

matrix  x  such  that  ^      , 

a=  Dx, 

on  the  other  hand  a  matrix  y  such  that 

a  =  yb. 

On  account  of  this  ambiguity,  the  term  division  is  not  ordinarily  used 
here.     We  have,  however,  as  is  easily  seen,  the  following  theorem  : 
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Theorem  4.     If  a.  is  any  matrix  and  b  any  non- singular  matrix^ 
there  exists  one,  and  only  one,  matrix  x  which  satisfies  the  equation 

a  =  bx, 

and  one,  and  only  one,  matrix  y  which  satisfies  the  equation 

a  =  yb, 

and  these  matrices  are  given  respectively  by  the  formuloe 

X  =  b"^a,        y  =  ab~^ 

A  special  class  of  matrices  is  of  some  importance  ;  namely,  those 
of  the  type 


h    0     . 

.     0 

0    k    . 

.     0 

•            • 

0     0     • 

.   h 

Such  matrices  we  will  call  scalar  matrices  for  a  reason  which  will 
presently  appear. 

If  we  denote  by  k  the  scalar  matrix  just  written,  and  by  a  any 
matrix  of  the  same  order  as  k,  we  obtain  readily  the  formula 

(1)  ka  =  ak  =  ha.. 

If  now,  besides  the  scalar  matrix  k,  we  have  a  second  scalar  matrix 
1  in  which  each  element  in  the  principal  diagonal  is  I,  we  have  the 
two  formulae 

(2)  k  +  l  =  l  +  k  =  (^  +  OI, 

(3)  kl  =  lk  =  HL 

Formula  (1)  shows  that  scalar  matrices  may  be  replaced  by  ordinary 
scalars  when  they  are  to  be  multiplied  by  other  matrices  ;  while 
formulse  (2)  and  (3)  show  that  scalar  matrices  combined  with  one 
another  not  only  obey  all  the  laws  of  ordinary  scalars,  but  that  each 
scalar  matrix  may  in  such  cases  be  replaced  by  the  scalar  which 
occurs  in  each  element  of  its  principal  diagonal  provided  that  at  the 
end  of  the  work  the  resulting  scalar  be  replaced  by  the  correspond' 
ing  scalar  matrix. 
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For  these  reasons  we  may,  in  the  algebra  of  matrices,  replace  all 
scalar  matrices  by  the  corresponding  scalars,  and  then  consider  that 
all  scalars  which  enter  into  our  work  stand  for  the  corresponding 
scalar  matrices.  If  we  do  this,  the  unit  matrix  I  will  be  represented 
by  the  symbol  1. 

Definition  2.  By  the  adjoint  K  of  a  matrix  a  is  uiider stood 
another  matrix  of  the  same  order  in  which  the  element  in  the  ith  row 
and  jth  column  is  the  cof actor  of  the  element  in  the  jth  row  and  ith 
column  of  a.* 

It  will  be  seen  that  when  a  is  non-singular, 

(4)  K  =  adC-\ 

but  it  should  be  noticed  that  while  every  matrix  has  an  adjoint,  only 
non-singular  matrices  have  inverses. 

Equation  (4)  may  be  written  in  the  form 

(5)  Aa  =  aA  =  aI, 

a  form  in  which  it  is  true  net  merely  when  a  is  non-singular,  but  also, 
as  is  seen  by  direct  multiplication,  when  the  determinant  of  a  is  zero. 
Finally  we  come  to  a  few  important  theorems  concerning  the 
rank  of  the  matrix  obtained  by  multiplying  together  two  given 
matrices.  In  the  first  place,  we  notice  that  the  rank  of  the  product 
is  not  always  completely  determined  by  the  ranks  of  the  factors. 
This  may  be  shown  by  numerous  examples,  for  instance,  in  formula 
(5),  §  22,  the  ranks  of  the  factors  are  in  general  two  and  one,  and  the 
rank  of  the  product  is  zero,  while  in  the  formula 


a 


11 


a 


12 


a 


21 


^31 


a 


22 


a 


32 


0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

a 


12 


a 


22 


a 


32 


the  ranks  of  the  factors  are  in  general  the  same,  namely  two  and 
one,  while  the  rank  of  the  product  is  one. 

But  though,  as  this  example  shows,  the  ranks  of  the  factors  (even 
together  with  the  order  of  the  matrices)  do  not  suffice  to  determine 
the  rank  of  the  product,  there  are,  nevertheless,  important  inequali- 
ties between  these  ranks,  one  of  which  we  now  proceed  to  deduce. 

*  Notice  the  interchange  of  rows  and  columns  here,  which  in  the  case  of  adjoint 
determinants,  being  immaterial  and  sometimes  inconvenient,  was  not  made. 
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For  this  purpose  consider  the  two  matrices 


a  = 


^11   *■■  ^In 


^n\   '"  ^nn 


b  = 


^1   •••  hn 


^m  '"  ^7 


and  their  product  ab. 


Theorem  5.  Any  k-rowed  determmant  of  the  matrix  ab  is  equal 
to  an  aggregate  of  k-rowed  determinants  of  b  each  multiplied  into  a 
polynomial  in  the  a's^  and  also  to  an  aggregate  of  k-rowed  determinants 
of  Si  each  multiplied  by  a  polynomial  in  the  b^s. 

For  any  A;-rowed  determinant  of  ab  may  be  broken  up  into  a  sum 
of  determinants  of  the  ^fch  order  in  such  a  way  that  each  column  of 
each  determinant  has  one  of  the  ^'s  as  a  common  factor.*  After 
taking  out  these  common  factors  from  each  determinant,  we  have 
left  a  determinant  in  the  a's  which,  if  it  does  not  vanish  identically,  is 
a  ^- rowed  determinant  of  a.  Or,  on  the  other  hand,  we  may  break 
up  the  A^-rowed  determinant  of  ab  into  a  sum  of  determinants  of  the 
^th  order  in  such  a  way  that  each  row  of  each  determinant  has  one 
of  the  a's  as  a  common  factor.  After  taking  out  these  common  factors 
from  each  determinant,  we  have  left  a  determinant  in  the  5's  which, 
if  it  does  not  vanish  identically,  is  a  A:-rowed  determinant  of  b. 

From  the  theorem  just  proved  it  is  clear  that  if  all  the  y^-rowed 
determinants  of  a  or  of  b  are  zero,  the  same  will  be  true  of  all  the 
Z;-rowed  determinants  of  ab.     Hence 

Theorem  6.  The  rank  of  the  product  of  two  matrices  cannot 
exceed  the  rank  of  either  f actor, -f 


*  The  truth  of  this  statement  and  the  following  will  be  evident  if  the  reader 
actually  writes  out  the  matrix  ab. 

t  Thus  if  vi  and  r2  are  the  ranks  of  the  two  factors  and  B  is  the  rank  of  the  prod- 
uct, we  have  i?"^ri,  i?<r2.  This  is  one  half  of  Sylvester's  "Law  of  Nullity,"  of 
which  the  other  half  may  be  stated  in  the  form  B^ri  +  r2  —  n,  where  n  is  the  order 
of  the  matrices  ;  cf.  Exercise  8  at  the  end  of  this  section.  Sylvester  defines  the  nullity 
of  a  matrix  as  the  difference  between  its  order  and  its  rank,  so  that  his  statement  of 
the  law  of  nullity  is  :  The  nullity  of  the  product  of  two  matrices  is  at  least  as  great 
as  the  nullity  of  either  factor,  and  at  most  as  great  as  the  sum  of  the  nullities  of  the 
factors. 
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There  is  one  important  case  in  which  this  theorem  enables  us  to 
determine  completely  the  rank  of  the  product,  namely,  the  case  in 
which  one  of  the  two  matrices  a  or  b  is  non-singular.  Suppose  first 
that  a  is  non-singular,  and  denote  the  ranks  of  b  and  ab  by  r  and  It 
respectively.  By  Theorem  6,  R<r.  We  may,  however,  also  regard 
b  as  the  product  of  a~^  and  ab,  and  hence,  applying  Theorem  6 
again,  we  have  v^R.  Combining  these  two  results,  we  see  that 
r  =  R. 

On  the  other  hand,  if  b  is  non-singular,  and  we  denote  the  ranks 
of  a  and  ab  respectively  by  r  and  jR,  we  get  from  Theorem  Q^  R^r\ 
and,  applying  this  theorem  again  to  the  equation 

(ab)  b-i  =  a, 

we  have  r  ^  R.     Thus  again  we  get  r  =  R. 
We  have  thus  established  the  result : 

Theorem  7.  -If  a  matrix  of  rank  r  is  multiplied  in  either  order  hy 
a  non-singular  matrix^  the  rank  of  the  product  is  also  r, 

EXERCISES 

1.  Prove  that  a  necessary  and  sufficient  condition  that  two  matrices  a  and  b 

of  the  same  order  be  equivalent  is  that  there  exist  two  non-singular  matrices 

c  and  d  such  that  , 

dac  =  b. 

Cf.  §  22,  Exercise  2,  and  §  19,  Exercise  4. 

2.  Prove  that  a  necessary  and  sufficient  condition  that  two  matrices  a  and  b 
of  the  same  order  be  equivalent  is  that  there  exist  four  matrices  c,  d,  e,  f  such 

dac  =  b,        a  =  fbe. 

3.  Prove  that  every  matrix  of  rank  r  can  be  written  as  the  sum  of  r  matrices 
of  rank  one.* 

[Suggestion.  Notice  that  the  special  matrix  mentioned  in  §  19,  Exercise  3,  can  be 
so  written.] 

*  A  matrix  of  rank  one  has  been  called  by  Gibbs  a  dyad,  since  it  may  (cf. 
§  19,  Ex.  5)  be  regarded  as  a  product  of  two  complex  quantities  (ai,  a'2,^  •••  cin)  and 
(6i,  &25  •••  ?>n).  The  sum  of  any  number  of  dyads  is  called  a  dyadic  polynomial,  or 
simply  a  dyadic.  Every  matrix  is  therefore  a  dyadic,  and  vice  versa.  Gibbs's  theory 
of  dyadics,  in  the  case  w  =  3,  is  explained  in  the  Vector  Analysis  of  Gibbs-Wilson, 
Chap.  V.  Geometric  language  is  used  here  exclusively,  the  complex  quantities 
(«!,  a2,  (iz)  and  (&i,  62,  ?>3)  from  which  the  dyads  are  built  up  being  interpreted  as 
vectors  in  space  of  three  dimensions.  This  theory  is  equivalent  to  Hamilton's  theory 
of  the  Linear  Vector  Function  in  Quaternions. 
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4.  Prove  that  a  necessary  and  sufficient  condition  that  a  matrix  be  a  divisor 
of  zero  (cf.  §  22,  Exercise  1)  is  that  it  be  singular. 

[Suggestion.     Consider  equivalent  matrices.] 

5.  Prove  that  the  inverse  of  the  product  of  any  number  of  non-singular 
matrices  is  the  product  of  the  inverses  of  these  matrices  taken  in  the  reverse 
order. 

Hence  deduce  a  similar  theorem  concerning  the  adjoint  of  a  product  of  any 
number  of  matrices,  whether  these  matrices  are  singular  or  not. 
What  theorem  concerning  determinants  can  be  inferred  ? 

6.  Prove  that  the  conjugate  of  the  inverse  of  a  non-singular  matrix  is  the 
inverse  of  the  conjugate ;  and  that  the  conjugate  of  the  adjoint  of  any  matrix 
is  the  adjoint  of  the  conjugate. 

7.  Prove  that  if  a  matrix  has  the  property  that  its  product  with  every  matrix 
of  the  same  order  is  commutative,  it  is  necessarily  a  scalar  matrix. 

8.  If  n  and  r^  are  the  ranks  of  two  matrices  of  order  w,  and  R  the  rank  of 
their  product,  prove  that  »">  * 

[Suggestion.  Prove  this  theorem  first  on  the  supposition  that  one  of  the  two 
matrices  which  are  multiplied  together  is  of  the  form  mentioned  in  Exercise  3,  §  19, 
using  also  at  this  point  Exercise  1,  §  8.  Then  reduce  the  general  case  to  this  one  by 
means  of  Exercise  1  of  this  section.] 


26.  Sets,  Systems,  and  Groups.  These  three  words  are  the 
technical  names  for  conceptions  which  are  to  be  met  with  in  all 
branches  of  mathematics.  In  fact  the  first  two  are  of  such  gener- 
ality that  they  may  be  said  to  form  the  logical  foundation  on  which 
all  mathematics  rests,  f  In  this  section  we  propose,  after  having 
given  a  brief  explanation  of  these  three  conceptions,  to  show  how 
they  apply  to  the  special  subjects  considered  in  this  chapter. 

The  objects  considered  in  mathematics  —  we  use  the  word  object 
in  the  broadest  possible  sense  —  are  of  the  most  varied  kinds.  We 
have,  on  the  one  hand,  to  mention  a  few  of  the  more  important  ones, 
the  different  kinds  of  quantities  ranging  all  the  way  from  the  posi- 
tive integers  to  complex  quantities  and  matrices.  Next  we  have  in 
geometry  not  only  points,  lines,  curves,  and  surfaces  but  also  such 

*  Cf.  the  footnote  to  Theorem  6. 

t  Eor  a  popular  exposition  of  the  point  of  view  here  alluded  to,  see  my  address  on 
The  Fundamental  Conceptions  and  Methods  of  Mathematics^  St.  Louis  Congress  of 
Arts  and  Science,  1904.     Keprinted  in  Bull.  Amer.  Math.  Soc. ,  December,  1904. 
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things  as  displacements  (rotations,  translations,  etc.),  collineations, 
and,  in  fact,  geometrical  transformations  in  general.  Then  in  vari- 
ous parts  of  mathematics  we  have  to  deal  with  the  Theory  of 
Substitutions,  that  is,  with  the  various  changes  which  can  be  made 
in  the  order  of  certain  objects,  and  these  substitutions  themselves 
may  be  regarded  as  objects  of  mathematical  study.  Finally,  in 
mechanics  we  have  to  deal  with  such  objects  as  forces,  couples, 
velocities,  etc. 

These  objects,  and  all  others  which  are  capable  of  mathematical 
consideration,  are  constantly  presenting  themselves  to  us,  not  singly, 
but  in  sets.  Such  sets  (or,  as  they  are  sometimes  called,  classes)  of 
objects  may  consist  of  a  finite  or  an  infinite  number  of  objects,  or 
elements.     We  mention  as  examples  : 

(1)  All  prime  numbers. 

(2)  All  lines  which  meet  two  given  lines  in  space. 

(3)  All  planes  of  symmetry  of  a  given  cube. 

(4)  All  substitutions  which  can  be  performed  on  five  letters. 

(5)  All  rotations  of  a  plane  about  a  given  line  perpendicular  to  it. 

Having  thus  gained  a  slight  idea  of  the  generality  of  the  con- 
ception of  a  set,  we  next  notice  tliat  in  many  cases  in  which  we  have 
to  deal  with  a  set  in  mathematics,  there  are  one  or  more  rules  by 
which  pairs  of  elements  of  the  set  may  be  combined  so  as  to  give 
objects,  either  belonging  to  the  set  or  not  as  the  case  may  be.  As 
examples  of  such  rules  of  combination,  we  mention  addition  and 
multiplication  both  in  ordinary  algebra  and  in  the  algebra  of  ma- 
trices ;  the  process  by  which  two  points,  in  geometry,  determine  a 
line  ;  the  process  of  combining  two  displacements  to  give  another 
displacement,  etc. 

Such  a  set,  with  its  associated  rules  of  combination,  we  will  call 
a  mathematical  system.,  or  simply  a  system.* 

We  come  now  to  a  very  important  kind  of  system  known  as  a 
group^  which  we  define  as  follows  : 

*  This  definition  is  sufficiently  general  for  our  immediate  purposes.  In  general, 
however,  it  is  desirable  to  admit,  not  merely  rules  of  combination,  but  also  relations  be- 
tween the  elements  of  a  system.  In  fact  we  may  have  merely  one  or  more  relations 
and  no  rules  of  combination  at  all.  From  this  point  of  view  the  positive  integers  with 
the  relation  of  greater  and  less  would  form  a  system,  even  though  we  do  not  introduce 
any  rule  of  combination  such  as  addition  or  multiplication.  It  may  be  added  that  rules 
of  combination  may  be  regarded  as  merely  relations  between  three  objects ;  cf.  the 
address  referred  to  above. 
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Definition.  A  system  consisting  of  a  set  of  elements  and  one  rule 
of  combination^  which  we  will  denote  by  o,  is  called  a  group  if  the  follow- 
ing conditions  are  satisfied : 

(1)  If  a  and  b  are  any  elements  of  the  set^  whether  distinct  or  not^ 
aob  is  also  an  element  of  the  set.* 

(2)  The  associative  law  holds ;  that  is^  if  a,  6,  c  are  any  elements 
of  the  set,  {aob)o  c  =  ao{boc). 

(3)  The  set  contains  an  element,  ^,  called  the  identical  element, 
which  is  such  that  every  element  is  unchanged  when  combined  with  it. 


lo  a  =  aoi  =  a. 

(4)   If  a  is  any  element,  the  set  also  contains  an  element  a\  called  the 

a'  o  a  =  a  o  a'  =  i. 


inverse  of  a,  such  that  ^i  ^  ^  _  ^  ^  ^f 


Thus,  for  example,  the  positive  and  negative  integers  with  zero 
form  a  group  if  the  rule  of  combination  is  addition.  In  this  case 
zero  is  the  identical  element,  and  the  inverse  of  any  element  is  its 
negative.  These  same  elements,  however,  do  not  form  a  group  if 
the  rule  of  combination  is  multiplication,  for  while  conditions  (1), 
(2),  and  (3)  are  fulfilled  (the  identical  element  being  1  in  this  case), 
condition  (4)  is  not,  since  zero  has  no  reciprocal. 

Again,  the  set  of  all  real  numbers  forms  a  group  if  the  rule 
of  combination  is  addition,  but  not  if  it  is  multiplication,  since  in 
this  case  zero  has  no  inverse.  If  we  exclude  zero  from  the  set,  we 
have  a  group  if  the  rule  of  combination  is  multiplication,  but  not  if 
it  is  addition. 

As  an  example  of  a  group  with  a  finite  number  of  elements  we 
mention  the  four  numbers 


+  1,  -1,  +V-1,  -V-1 

with  multiplication  as  the  law  of  combination. 

In  order  to  get  an  example  of  a  group  of  geometrical  operations, 
let  us  consider  the  translations  of  a  plane,  regarded  as  a  rigid  lamina, 
in  the  directions  of  its  own  lines.  Ever}^  such  translation  may  be 
represented  both  in  magnitude  and  in  direction  by  the  length  and 

*  A  system  satisfying  condition  (1)  is  sometimes  said  to  have  "the  group  prop- 
erty." In  the  older  works  on  the  subject  this  condition  was  the  only  one  to  be 
explicitly  mentioned,  the  others,  however,  being  tacitly  assumed. 
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direction  of  an  arrow  lying  in  the  plane  in  question.  Two  such 
translations  performed  in  succession  are  obviously  equivalent  to  a 
translation  of  the  same  sort  represented  by  the  arrow  obtained  by 
combining  the  two  given  arrows  according  to  the  law  of  the  paral- 
lelogram of  forces.  The  set  of  all  translations  with  the  law  of 
combination  just  explained  is  readily  seen  to  form  a  group  if  we 
include  in  it  the  null  translation^  i.e.  the  transformation  which  leaves 
every  point  in  the  plane  fixed.  This  null  translation  is  then  the 
identical  element,  and  two  translations  are  the  inverse  of  each  other 
if  they  are  equal  in  magnitude  and  opposite  in  direction. 

All  the  groups  we  have  so  far  mentioned  satisfy,  not  only  the 
four  conditions  stated  in  the  definition,  but  also  a  fifth  condition, 
viz.  that  the  law  of  combination  is  commutative.  Such  groups  are 
called  commutative  or  Ahelian  groups.  In  general,  however,  groups 
do  not  have  this  property.  As  examples  of  non-Abelian  groups, 
we  may  mention  first  the  group  of  all  non-singular  matrices  of  a 
given  order,  the  rule  of  combination  being  multiplication  ;  and 
secondly  the  group  of  all  matrices  of  a  given  order  whose  deter- 
minants have  the  value  ±1,  the  rule  of  combination  being  again 
multiplication.  This  second  group  is  called  a  subgroup  of  the  first, 
since  all  its  elements  are  also  elements  of  the  first  group,  and  the 
rule  of  combination  is  the  same  in  both  cases.  A  subgroup  of  the 
group  last  mentioned  is  the  group  of  all  matrices  of  a  given  order 
whose  determinants  have  the  value  -f  1,*  the  rule  of  combination 
being  multiplication. 

We  add  that  non-Abelian  groups  may  readily  be  built  up  whose 
elements  are  linear  transformations,  or  collineations.  On  the  other 
hand,  Abelian  groups  may  be  formed  from  matrices  if  we  take  as  our 
rule  of  combination  addition  instead  of  multiplication. 

27.  Isomorphism. 

Definition.  Two  groups  are  said  to  he  isomorphic^  if  it  is  possible 
to  establish  a  one-to-one  correspondence  between  their  elements  of  such  a 

*  These  are  called  unimodular  matrices  ;  or,  more  accurately,  properly  unimodular 
matrices  to  distinguish  them  from  the  improperly  unimodular  matrices  whose  determi- 
nants have  the  value  —  1.  It  should  be  noticed  that  these  last  matrices  taken  by  them- 
selves do  not  constitute  a  group,  since  they  do  not  even  have  the  group  property. 

t  Simply  isomorphic  would  be  the  more  complete  term.  We  shall,  however,  not 
be  concerned  with  isomorphism  which  is  not  simple. 
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sort  that  if  a^  h  are  any  elements  of  the  first  group  and  a\  V  the  corre- 
sponding elements  of  the  second^  then  a'  o  h'  corresponds  to  aob,* 

We  proceed  to  illustrate  this  definition  by  some  examples, 
leaving  to  the  reader  the  proofs  of  the  statements  we  make.  In  each 
case  we  omit  the  statement  of  the  rule  of  combination  in  the  case  of 
transformations,  where  no  misunderstanding  is  possible. 

First  Example,     (a)  The  group  of  the  four  elements 


1,  V-1,  -1,  -V-1, 

the  rule  of  combination  being  multiplication. 

(h)  The  group  of  four  rotations  about  a  given  line  through  angles 
of  0°,  90°,  180°,  270°. 

These  two  groups  may  be  proved  to  be  isomorphic  by  pairing  the 
elements  against  one  another  in  the  order  in  which  they  have  just 
been  written. 

Second  Example,  (a)  The  group  of  the  four  matrices 
1     0\  f-1         0\  (1         OW-1     0 

0  i)\  0  ~i;Ao   -vA  0  1 

the  rule  of  combination  being  multiplication. 

{h)  The  group  of  the  following  four  transformations  :  the  iden- 
tical transformation  ;  reflection  in  a  plane  ;  reflection  in  a  second 
plane  at  right  angles  to  the  first ;  rotation  through  180°  about  the 
line  of  intersection  of  these  two  planes. 

(c)  The  group  consisting  of  the  identical  transformation  and  of 
three  rotations  through  angles  of  180°  about  three  straight  lines 
through  a  point  at  right  angles  to  one  another. 

The  two  groups  of  Example  1  are  not  isomorphic  with  the  three  of 
Example  2  in  spite  of  the  fact  that  there  are  the  same  number  of  ele- 
ments in  all  the  groups.  This  follows  from  the  presence  of  two 
elements  in  the  groups  of  Example  1  whose  squares  are  not  the 
identical  element. 

*  This  idea  of  isomorphism  may  obviously  be  extended  to  the  case  of  any  two  sys- 
tems provided  merely  that  there  are  the  same  number  of  rules  of  combination  in  both 
cases.  Thus  the  system  of  all  scalar  matrices  on  the  one  hand  and  of  all  scalars  on  the 
other,  the  rules  of  combination  being  in  both  cases  addition  and  multiplication,  are  ob- 
viously isomorphic.  It  is  for  this  reason  that  no  confusion  arises  if  no  distinction  is 
made  between  scalar  matrices  and  scalars. 
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Third  Example,  (a)  The  group  of  all  real  quantities ;  the  rule 
of  combination  being  addition. 

(b)  The  group  of  all  scalar  matrices  of  order  k  with  real  ele- 
ments ;  the  rule  of  combination  being  addition. 

(c)  The  group  of  all  translations  of  space  parallel  to  a  given 
line. 

Fourth  Example,  {a)  The  group  of  all  non-singular  matrices 
of  order  n,  with  multiplication  as  the  rule  of  combination. 

(5)  The  group  of  all  non-singular  homogeneous  linear  transfor- 
mations in  n  variables. 

We  might  be  tempted  to  mention  as  a  group  of  geometrical  trans- 
formations isomorphic  with  the  last  two  groups,  the  group  of  all 
non-singular  collineations  in  space  of  n  —  1  dimensions.  This,  how- 
ever, would  be  incorrect,  for  the  correspondence  we  have  established 
between  collineations  and  linear  transformations  is  not  one-to-one ; 
to  every  linear  transformation  corresponds  one  collineation,  but  to 
every  collineation  correspond  an  infinite  number  of  linear  transfor- 
mations, whose  coefficients  are  proportional  to  one  another.*  In 
order  to  get  a  group  of  geometrical  transformations  isomorphic  with 
the  group  of  non-singular  matrices  of  the  nth  order  it  is  sufficient 
to  interpret  the  variables  x^^  •••  x^  as  non-homogeneous  coordinates  in 
space  of  n  dimensions,  and  to  consider  the  geometric  transformation 
effected  by  non-singular  homogeneous  linear  transformations  of 
these  x's.  These  transformations  are  those  affine  transformations  of 
space  of  n  dimensions  which  leave  the  origin  unchanged  ;  cf.  the 
footnote  on  p.  70.  Thus  the  group  of  all  non-singular  matrices  of 
the  nth  order  is  isomorphic  with  a  certain  subgroup  of  the  group 
of  collineations  in  space  of  n  dimensions,  not  with  the  group  of  all 
non-singular  collineations  in  space  oi  n  —  1  dimensions. 

An  essential  difference  between  these  two  groups  is  that  one 

*  This  does  not  really  prove  that  the  groups  are  not  isomorphic,  since  it  is  con- 
ceivable that  some  other  correspondence  might  be  established  between  their  elements 
which  would  be  one-to-one  and  of  such  a  sort  as  to  prove  isomorphism.  Even  the 
fact,  to  be  pointed  out  presently,  that  the  groups  depend  on  a  different  number  of 
parameters  does  not  settle  the  question.  A  reference  to  the  result  stated  in  Exercise  7, 
§25,  shows  that  the  groups  are  not  isomorphic;  for,  according  to  it,  the  only  non- 
singular  collineation  which  is  commutative  with  all  collineations  is  the  identical 
transformation,  whereas  all  linear  transformations  with  scalar  matrices  have  this 
property. 
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depends  on  r^  parameters  (the  r^  coefficients  of  the  linear  transforma- 
tion) while  the  other  depends  only  on  m^  —  1  parameters  (the  ratios 
of  the  coefficients  of  the  collineation). 

We  can,  however,  by  looking  at  the  subject  a  little  differently, 
obtain  a  group  of  matrices  isomorphic  with  the  group  of  all  non- 
singular  collineations  in  space  of  n  —  1  dimensions.  For  this  purpose 
we  need  merely  to  regard  two  matrices  as  equal  whenever  the  ele- 
ments of  one  can  be  obtained  from  those  of  the  other  by  multiplying 
all  the  elements  by  the  same  quantity  not  zero.  When  we  take  this 
point  of  view  with  regard  to  matrices,  it  is  desirable  to  indicate  it 
by  a  new  terminology  and  notation.  According  to  a  suggestion  of 
E.  H.  Moore  of  Chicago,  we  will  call  such  matrices  fractional 
matrices,  and  write  them 


,  etc. 


Agreeing  that  fractional  matrices  are  to  be  added  and  mul- 
tiplied according  to  the  same  rules  as  ordinary  matrices,  we  may  now 
say  that  the  group  of  all  non-singular  collineations  in  space  Q>i  n—\ 
dimensions  is  isomorphic  with  the  group  of  all  fractional  matrices  of 
the  n\h  order  whose  determinants  are  not  zero.* 

To  take  another  example,  the  groups  in  the  second  example  above 
are  isomorphic  with  the  group  whose  elements  are  the  four  fractional 
matrices 


«11 

^12 

t 

H\ 

^12 

^13 

H\ 

^22 

^23 

H\ 

^22 

H\ 

%2 

^33 

1     0 


0     1 


1 


0 


0     -1 


0     1 


1     0 


0     1 


1     0 


and  where  the  law  of  combination  is  multiplication.  These  four 
matrices,  if  regarded  as  ordinary  matrices,  would  not  even  satisfy 
the  first  condition  for  a  group. 

The  reader  wishing  to  get  a  further  insight  into  the  theory  of 
groups  of  linear  transformations  will  find  the  following  three  treat- 

*  It  should  be  noticed  that  we  cannot  speak  of  the  value  of  the  determinant  of  a 
fractional  matrix  unless  this  value  is  zero,  for  if  we  multiply  all  the  elements  of  the 
matrix  by  c  we  do  not  change  the  matrix,  but  do  multiply  the  determinant  by  C*. 
There  is  in  particular  no  such  thing  as  a  unimodular  fractional  matrix.  We  may, 
however,  speak  of  the  rank  of  a  fractional  matrix. 
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ments   interesting   and   instructive.     They  duplicate  each,  other  to 

only  a  very  slight  extent. 
Weber,  Algebra^  Vol.  II. 
Klein,  Vorlesungen  uber  das  Ikosaeder, 
Lie-Scheffers,  Vorlesungen  iiher  continuirliche  G-ruppen. 

EXERCISES 

1.  Definition.  A  group  is  said  to  he  of  order  n  if  it  contains  n,  and  only  n, 
elements. 

If  a  group  of  order  n  has  a  subgroup,  prove  that  the  order  of  this  subgroup  is 
a  factor  of  n. 

[Suggestion.  Denote  the  elements  of  the  subgroup  by  ai  •••  ak-,  and  let  6  be  any 
other  element  of  the  group.  Show  that  bai,  ba^,  •••  bajc  are  all  elements  of  the  group 
distinct  from  each  other  and  distinct  from  the  a's.  If  there  are  still  other  elements, 
let  c  be  one  and  consider  the  elements  cai,  •••  ca^,  etc.] 

2.  Prove  that  if  a  is  any  element  of  a  group  of  finite  order,  it  is  possible  by 
multiplying  a  by  itself  a  sufficient  number  of  times  to  get  the  identical  element. 

Definition.  The  lowest  power  to  which  a  can  be  raised  so  as  to  give  the  identical 
element  is  called  the  period  of  a. 

3.  Prove  that  every  element  of  a  group  of  order  n  has  as  its  period  a  factor 
of  n  (1  and  n  included). 

4.  Definition.  A  group  is  called  cyclic  if  all  its  elements  are  powers  of  a 
single  element. 

Prove  that  all  cyclic  groups  of  order  n  are  isomorphic  with  the  group  of  rota- 
tions about  an  axis  through  angles  0,  w,  2  to,  •••  (n  —  1)  w,  where  o>  =  2  ir/n,  and 
that  conversely  every  such  group  of  rotations  is  a  cyclic  group. 

5.  Prove  that  every  group  whose  order  is  a  prime  number  is  a  cyclic  group. 

6.  Prove  that  all  groups  of  order  4  are  either  cyclic  or  isomorphic  with  the 
groups  of  the  second  example  above.  A  group  of  this  last  kind  is  called  a  fours 
group  (  Vierergruppe^ . 

7.  Obtain  groups  with  regard  to  one  or  the  other  of  which  all  groups  of 
order  6  are  isomorphic. 

8.  Obtain  groups  with  regard  to  one  or  the  other  of  which  all  groups  of 
order  8  are  isomorphic. 


CHAPTER   VII 
INVARIANTS.     FIRST  PRINCIPLES  AND  ILLUSTRATIONS 

28.  Absolute   Invariants,    Geometric,  Algebraic,  and  Arithmetical. 

If  we  subject  a  geometric  figure  to  a  transformation,  we  find  that, 
while  many  properties  of  the  figure  have  been  altered,  others  have 
not.  If  we  consider,  not  a  single  transformation,  but  a  set  of  trans- 
formations, then  those  properties  of  figures  which  are  not  changed  by 
any  of  the  transformations  of  the  set  are  said  to  be  invariant  prop- 
erties with  regard  to  this  set  of  transformations.  Thus  if  our  set  of 
transformations  is  the  group  of  all  displacements,  the  property  of 
two  lines  being  parallel  or  perpendicular  to  each  other  and  the 
property  of  a  curve  being  a  circle  are  invariant  properties,  since 
after  the  transformation  the  lines  will  still  be  parallel  or  perpendicu- 
lar and  the  curve  will  still  be  a  circle.  If,  however,  we  consider, 
not  the  group  of  displacements,  but  the  group  of  all  non-singular 
collineations,  none  of  the  properties  just  mentioned  will  be  invariant 
properties.  Properties  invariant  with  regard  to  all  non-singular 
collineations  have  played  such  an  important  part  in  the  development 
of  geometry  that  a  special  name  has  been  given  to  them,  and  they 
are  called  projective  or  descriptive  properties.  As  examples  of  such 
projective  properties  we  mention  the  collinearity  and  complanarity 
of  points,  the  complanarity  and  concurrence  of  lines,  etc.;  or,  on 
the  other  hand,  the  contact  of  a  line  with  a  curve  or  a  surface  or 
the  contact  of  two  curves  or  of  two  surfaces,  or  of  a  curve  and  a 
surface. 

Definition  1.  If  there  is  associated  with  a  geometric  figure  a 
quantity  which  is  unchanged  hy  all  the  transformations  of  a  certain 
set^  then  this  quantity  is  called  an  invariant  with  regard  to  the  trans- 
formations of  the  set. 

For  instance,  if  our  set  of  transformations  is  the  group  of  dis- 
placements, the  distance  between  two  points  and  the  angle  between 
two  lines  would  be  two  examples  of  invariants. 
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The  geometric  invariants  so  far  considered  lead  up  naturally  to 
the  subject  of  algebraic  invariants.     Thus  let  us  consider  the  two 


polynomials 
(1) 


A^x  +  B^y  +  (72, 


(2) 


and  subject  the  variables  {x,  y)  to  the  transformations  of  the  set 

x'  =x  cos 0  -\-  y  ^mO  +  a, 
y'  z=  —X  sin  B  +  y  cos  6  -\-  fS^ 

where  a,  /3,  6  are  parameters  which  may  have  any  values.  The  trans- 
formation (2)  carries  over  the  polynomials  (1)  into  two  new  poly- 
nomials : 

' A[x' -^  B\y^ -\- C[, 

A'^x' -h  B'^y' -{- C^, 

The  coefficients  of  (3)  may  be  readily  expressed  in  terms  of  the  coeffi- 
cients of  (1)  and  the  parameters  a,  jS,  6.  Using  these  expressions, 
we  easily  obtain  the  formulae 

'A[B'^  -  A'^B[  =  A^B^  -  A^B^, 

\a[A^-\-B[B^  =  A,A^+B,B^, 


(3) 


(4) 


We  shall  therefore  speak  of  the  two  expressions 
(5)  A^B^-A^B,,     ^1^2 +^1^2 

as  invariants  of  the  system  of  polynomials  (1)  with  regard  to  the  set 
of  transformations  (2)  according  to  the  following  general  definition  : 

Definition  2.  If  we  have  a  system  of  polynomials  in  the  variables 
{x^  y^  z^  ...)  and  a  set  of  transformations  of  these  variables^  then  any 
function  of  the  coefficients  of  the  polynomials  is  called  an  invariant  {or 
more  accurately  an  absolute  invariant)  with  regard  to  these  transforma- 
tions if  it  is  unchanged  when  the  polynomials  are  subjected  to  all  the 
transformations  of  the  set. 

The  relation  of  the  example  considered  above  to  the  subject  of 
geometric  invariants  becomes  obvious  when  we  notice  that  the  alge- 
braic transformations  (2)  may  be  regarded  as  expressing  the  dis- 
placements of  plane  figures  in  their  plane  when  {x^  y)  are  rectangular 
coordinates  of  points  in  the  plane.  If  now  we  consider,  not  the  poly- 
nomials (1),  but  the  two  lines  determined  by  setting  them  equal  to 
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zero,  we  have  to  deal  with  the  displacements  of  these  two  lines. 
The  invariants  (5)  have  themselves  no  geometric  significance,  but  by- 
equating  them  to  zero,  we  get  the  necessary  and  sufficient  conditions 
that  the  two  lines  be  respectively  parallel  and  perpendicular,  and 
these,  as  we  noticed  above,  are  invariant  properties  with  regard  to 
displacements.  Finally  we  may  notice  that  the  ratio  of  the  two  in- 
variants (5)  gives  the  tangent  of  the  angle  between  the  lines, — a 
geometric  invariant. 

As  a  second  example,  let  us  consider,  not  two  lines,  but  a  line  and 
a  point.     Algebraically  this  means  that  we  start  with  the  system 


(6) 


Ax  +  Bi/-{-  (7, 


consisting  of  a  polynomial  and  a  pair  of  variables.  We  shall  wish 
to  demand  here  that  whenever  the  variables  (x^  y)  are  subjected  to  a 
transformation,  the  variables  (iCj,  y^  be  subjected  to  the  same  trans- 
formation, or  as  we  say  according  to  Definition  3  below,  that  (a;,  y) 
and  (ajj,  y^  be  cogredient  variables.  If  we  subject  the  system  (6) 
to  any  transformation  of  the  set  (2),  we  get  a  new  system 


(7) 


and  it  is  readily  seen  that 

Mx[  +  By[  +a  =  Ax^  +  By^+  (7. 

Accordingly  we  shall  call  Ax^  +  By^  -\-  C  a>  covariant  of  the  system  (6) 
according  to  Definition  4  below.  This  covariant  has  also  no  direct 
geometric  meaning,  but  its  vanishing  gives  the  necessary  and  suffi- 
cient  condition  for  an  invariant  property,  namely,  that  the  point 

(p^v  Vi)  ^^^  ^^  ^^^®  ^^^^  ^^  +  ^y  +  ^=  0. 

In  the  light  of  this  example  we  may  lay  down  the  following  gen- 
eral definitions : 

Definition  3.    If  we  have  several  sets  of  variables 

and  agree  that  whenever  one  of  these  sets  is  subjected  to  a  transfor- 
mation every  other  set  shall  be  subjected  to  the  same  transformation^ 
then  we  say  that  we  have  sets  of  cogredient  variables. 
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Definitio:n^  4.  If  we  have  a  system  consisting  of  a  number  of  poly- 
nomials in  {x^  y,  2,  •••)  and  of  a  number  of  sets  of  variables  co- 
gredient  to  (x^  y^  z^  •••),  tlien  any  function  of  the  coefficients  of  the 
polynomials  and  of  the  cogredient  variables  which  is  unchanged  when 
the  variables  (x^  y^  ^^  ••*)  <^^^  subjected  to  all  the  transformations  of  a 
certain  set  is  called  a  covariant  {or  more  accurately  an  absolute  cova- 
riant)  of  this  system  with  regard  to  the  transformations  of  this  set. 

It  will  be  seen  that  invariants  may  be  regarded  as  special  cases  of 
CO  variants. 

Among  the  geometric  invariants  there  are  some  which  from  their 
nature  are  necessarily  integers,  and  which  we  will  speak  of  as  arith- 
metical invariants.  An  example  would  be  the  number  of  vertices  of 
a  polygon  if  our  set  of  transformations  was  either  the  group  of  dis- 
placements or  the  group  of  non-singular  collineations.  Another  ex- 
ample is  the  largest  number  of  real  points  in  which  an  algebraic 
curve  can  be  cut  by  a  line,  if  our  set  of  transformations  is  the  group 
of  real  non-singular  collineations. 

These  arithmetical  invariants  also  play,  as  we  shall  see,  an  impor- 
tant part  in  algebra.  We  mention  here  as  an  example  the  degree  of  an 
n-ary  form,  which  is  an  invariant  with  regard  to  all  non-singular  linear 
transformations.  * 

EXERCISES 

1.  Prove  that     (x^  —  xiy  +  (^2  —  yi)%  and 

are  covariants  of  the  system 

(^1,  yO,  (^2, 3/2),  (xs,  2/3) 

with  regard  to  the  transformations  (2). 

2.  Prove  that  A -{■  B  and  B'^-AC 
are  invariants  of  the  polynomial 

Ax^  +  2  Bxy  -{-  Cy'^-h2Dx-\-2Ey+F 
with  regard  to  the  transformations  (2). 

What  geometric  meaning  can  be  attached  to  these  invariants? 

3.  Prove  that  A^  -\-  B'^  is  am  invariant  of  the  polynomial 

Ax-\-  By-h  C 
with  regard  to  the  transformations  (2). 

Hence  show  that  Axi-{-Byi  +_C 

is  a  covariant  of  the  system  (6).     Note  its  geometric  meaning. 

*  It  is,  in  fact,  an  invariant  with  regard  to  all  linear  transformations  except  the 
one  in  which  all  the  coefficients  of  the  transformation  are  zero. 


^1  yi 

1 

X2    y2 

1 

xs   yz 

l| 
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29.   Equivalence. 

Definition  1.  If  A  cf^nd  B  denote  two  geometric  configurations 
or  two  algebraic  expressions^  or  sets  of  expressions^  then  A  and  B  shall 
he  said  to  be  equivalent  with  regard  to  a  certain  set  of  transformations 
when^  and  only  when^  there  exists  a  transformation  of  the  set  which  car- 
ries over  A  into  B  and  also  a  transformation  of  the  set  which  carries  over 
B  into  A. 

To  illustrate  this  definition  we  notice  that  the  conception  of  equiv- 
alence of  geometric  figures  with  regard  to  displacements  is  identical 
with  the  Euclidean  conception  of  the  equality  or  congruence  of  fig- 
ures. 

Again,  we  see  from  Theorem  2,  §  24,  that  on  a  straight  line  two 
sets  of  three  points  each  are  ahvays  equivalent  with  regard  to  non- 
singular  projective  transformations. 

In  both  of  the  illustrations  just  mentioned  the  set  of  transforma- 
tions forms  a  group.  In  such  cases  the  condition  for  equivalence  can 
be  decidedly  simplified,  for  the  transformation  which  carries  A  into 
B  has  an  inverse  belonging  to  the  set,  and  this  inverse  necessarily 
carries  B  into  A.     Thus  we  have  the 

Theorem.  A  necessary  and  sufficient  condition  for  the  equiva- 
lence of  A  and  B  with  regard  to  a  group  of  transformations  is  that  a 
transformation  of  the  group  carry  over  A  into  B. 

This  theorem  will  be  of  great  importance,  as  the  question  of 
equivalence  will  present  itself  to  us  only  when  the  set  of  transfor- 
mations we  are  considering  forms  a  group. 

Let  us  consider,  for  the  sake  of  greater  definiteness,  a  group  of 
geometric  transformations.  If  two  geometric  configurations  are 
equivalent  with  regard  to  this  group,  every  invariant  of  the  first 
configuration  must  be  equal  to  the  corresponding  invariant  of  the 
second.  Thus,  for  instance,  if  two  triangles  are  equivalent  with  re- 
gard to  the  group  of  displacements,  all  the  sides  and  angles  of  the 
first  will  be  equal  to  the  corresponding  sides  and  angles  of  the  second. 
The  same  will  be  true  of  the  altitudes,  lengths  of  the  medial  lines, 
radius  of  the  inscribed  circle,  etc.,  all  of  these  being  invariants. 
Now  one  of  the  first  problems  in  geometry  is  to  pick  out  from  among 
these  invariants  of  the  triangle  as  small  a  number  as  possible  whose 
equality  for  two  triangles  insures  the  equivalence  of  the  triangles. 
This  can  be  done,  for  instance,  by  taking  two  sides  and  the  included 
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angle,  or  two  angles  and  the  included  side,  or  three  sides.  Any  one 
of  these  three  elements  may  be  called  a  complete  system  of  invarimits 
for  a  triangle  with  regard  to  the  group  of  displacements,  since  two 
triangles  having  these  invariants  in  common  are  equivalent  and 
therefore  have  all  other  invariants  in  common.  The  conception  we 
have  here  illustrated  may  be  defined  in  general  terms  as  follows  : 

Definition  2.  A  set  of  invariants  of  a  geometric  configuration  or  an 
algebraic  expression  are  said  to  form  a  complete  system  of  invariants  if 
two  configurations  or  expressions  having  these  invariants  in  common  are 
necessarily  equivalent.* 

It  will  be  seen  from  this  definition  that  all  the  invariants  of  a 
geometric  configuration  or  of  an  algebraic  expression  are  uniquely 
determined  by  any  complete  system  of  invariants. 

Finally  we  will  glance  at  an  application  to  matrices  of  the  ideas 
of  invariants  and  equivalence.  Let  us  consider  matrices  of  the  nth. 
order,!  and  consider  transformations  of  the  following  form  which 
transform  the  matrix  A  into  the  matrix  B: 

(1)  aAb  =  B, 

where  a  and  b  are  any  non-singular  matrices  of  the  nth  order.  This 
transformation  may  be  denoted  by  the  symbol  (a,  b),  and  these  sym- 
bols must  obviously  be  combined  by  the  formula 

(a2,  bgXa^,  bi)  =  (a2ai,  bib2). 

By  means  of  this  formula  it  may  readily  be  shown  that  these  trans- 
formations form  a  group. 

According  to  our  general  definition  of  equivalence,  two  matrices 
A  and  B  must  therefore  be  said  to  be  equivalent  when,  and  only 
when,  two  non-singular  matrices  a  and  b  exist  which  satisfy  (1). 
That  this  definition  of  equivalence  amounts  to  the  same  thing  as  our 
earlier  definition  is  seen  by  a  reference  to  Exercise  1,  §  25. 

*  In  the  classical  theory  of  algebraic  invariants  this  term  is  used  in  a  different  and 
much  more  restricted  sense.  There  we  have  to  deal  with  integral  rational  relative  inva- 
riants (cf.  §  31).  By  a  complete  system  of  such  invariants  of  a  system  of  algebraic  forms 
is  there  understood  a  set  of  such  invariants  in  terms  of  which  every  invariant  of  this  sort 
of  the  system  of  forms  can  be  expressed  integrally  and  rationally.  Cf.  for  instance 
Clebsch,  Bindre  Formen,  p.  109. 

t  We  may,  if  we  choose,  confine  our  attention  throughout  to  matrices  with  real 
elements. 
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30.  The  Rank  of  a  System  of  Points  or  a  System  of  Linear  Forms 
as  an  Invariant.  Let  {x-^,  y-^,  z^,  t-^,  {x^,  y^,  z^,  t^,  {x^,  3/3,  z^,  t^)  be 
any  three  distinct  collinear  points,  so  that  the  rank  of  the  matrix 


x^ 

X. 


Vz 


U 


is  two.  Now  subject  space  to  a  non-singular  collineation  and  we  get 
three  new  points  which  will  also  be  distinct  and  collinear,  and  hence  the 
rank  of  their  matrix  will  also  be  two.  Thus  we  see  that  in  this  special 
case  the  rank  of  the  system  of  points  is  unchanged  by  a  non-singular 
collineation. 

Again,  let  a^x  -f-  h^y  -f-  c^z  -}-  d^t  =  0, 

a^x  -t-  b^y  +  c^z  -{-  d^t  =  0, 

a^x  -\-  h^y  -h  e^z  -j-  d^t  =  0, 

a^x  +  h^y  +  c^z  +  c?^^  =  0 

be  any  four  planes  which  liave  one,  and  only  one,  point  in  common, 
so  that  the  rank  of  their  matrix  is  three.  After  a  non-singular  collin- 
eation we  have  four  new  planes  which  will  also  have  one,  and  only 
one,  point  in  common,  and  hence  the  rank  of  the  matrix  of  their 
coefficients  will  be  three.  The  rank  of  this  system  of  planes  is 
therefore  unchanged  by  such  a  transformation. 
We  proceed  now  to  generalize  these  facts. 


Theorem  1.      The  rank  of  the  matrix  of  m  points 

{xf,xf,-xVi),  (i=l,  2,  . 

is  an  invariant  with  regard  to  non-singular  linear  transformations. 
Let 
(1) 


m) 


be  a  non-singular  linear  transformation  which  carries  the  points 
{x^i^  •••  x^^^  over  into  the  points  (Xf^,  •••  X^^).  Now  suppose  any  k 
of  the  points  (2;^^,  •••  a:{^^),  which  for  convenience  we  will  take  as  the 
first  ^,  are  linearly  dependent.  Then  there  exist  k  constants 
(<?j,  •••  Cj^  not  all  zero,  such  that 

(2)  c^x>^  4-  c^x'!  -K  . . .  +  c,xf^  =  0,  (j  =  1,  2,  .. .  n). 
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By  means  of  the  transformation  (1)  we  have 


X^?  =  c,,xf  + 


ji-^i 


4-  r-  Q-^^ 


hence 


•  •  •  +  ^yn(^i4  +  <^2^n  +  •  •  •  +  ^;fc4^0  0'  =  1.  2,  . . .  7l). 


Since  this  vanishes  on  account  of  (2),  the  first  k  of  the  points 
{Xf,"'Xl^^)  are  linearly  dependent.  Since  (1)  is  a  non-singular 
transformation,  it  is  immaterial  which  set  of  points  we  consider  as  the 
initial  set.  Thus  we  have  shown  that  if  any  k  points  of  either  set 
are  linearly  dependent,  the  corresponding  k  points  of  the  other  set 
will  be,  also. 

Now  if  the  rank  of  the  matrix  of  the  x's  is  r,  at  least  one  set  of  r 
of  the  a;-points  is  linearly  independent,  but  every  set  of  (r+1)  of 
them  is  linearly  dependent.  Consequently  the  same  is  true  for  the 
X-points,  and  therefore  their  matrix  must  also  be  of  rank  r. 

Theorem  2.     The  rank  of  the  matrix  of  m  linear  forms 

fii^V  ' ' '  ^n)  -  ^il^i  +  ^i2^2  -^ ^  ^in^n  (*'  =  1,   2,  •  •  •  m) 

is  an  invariant  with  regard  to  non- singular  linear  transformations. 

The  proof  of  this  theorem,  which  is  very  similar  to  the  proof  of 
Theorem  1,  we  leave  to  the  reader. 

It  will  be  noticed  that  the  invariants  we  have  considered  in  this 
section  are  examples  of  what  we  have  called  arithmetical  invariants. 

31.  Relative  Invariants  and  Covariants.  We  will  begin  by  con- 
sidering a  system  of  n  linear  forms  in  n  variables 


(1) 


^21  "^1  "I    ^22  -^2  "f"  " 

"1    ^In-^nt 
'  +  ^2»  ^»' 

Definition  1.    The  determinant 


a 


11 


a 


m 


^»l  *  * '  ^nn 


is  called  the  resultant  of  the  system  (1). 
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Let  us  now  subject  the  system  (1)  to  the  linear  transformation 


(2) 


This  gives  the  new  system  of  forms 


(3) 


where 


^11  ^1  +  •  •  •  +  ^m  ^r 


From  these  formulae  and  the  law  of  multiplication  of  matrices  we 


infer  that 
(4) 


a 


11 


a 


m 


a: 


m 


Cli„ 


11 


a 


\n 


a 


n\'" 


a. 


11 


"nl 


'in 


This  result  we  state  as  follows  : 

Theorem  1.  If  a  system  of  n  linear  forms  in  n  variables  with 
matrix  a  is  subjected  to  a  linear  transformation  with  matrix  c,  the  re- 
sulting system  has  as  its  matrix  ac. 

Taking  the  determinants  of  both  sides  of  (4),  we  see  that  the  re- 
sultant of  (1)  is  not  an  absolute  invariant.  It  is,  however,  changed 
in  only  a  very  simple  manner  by  a  linear  transformation,  namely,  by 
being  multiplied  by  the  determinant  of  the  transformation.  This 
leads  us  to  the  following  definition : 

DEFiKiTioisr  2.  A  rational  function  *  of  the  coefficients  of  a  form  or 
system  of  forms  which^  when  these  forms  are  subjected  to  any  non- 
singular  linear  transformation^  is  merely  multiplied  by  the  fith  power 
(//-  an  integer  f)  of  the  determinant  of  the  transformation  is  called  a  rela- 
tive invariant  of  weight  /ii  of  the  form  or  system  of  forms.  J  The  forms 
themselves  are  called  the  ground-forms. 

*  Besides  these  rational  invariants  we  may  also  consider  irrational  ones  (cf.  §  90), 
in  which  case  the  exponent  /x  will  not  necessarily  be  an  integer. 

t  The  condition  that  ix  be  an  integer  need  not  be  included  as  a  part  of  our 
hypothesis,  since  it  may  be  proved.  The  proof  that  /x  cannot  be  a  fraction  is  simple. 
The  proof  that  [x  cannot  be  irrational  or  imaginary  would  take  us  outside  of  the  domain 
of  algebra. 

X  From  this  definition  it  is  clear  that  every  relative  invariant  is  an  absolute  invariant 
withregardtothegroupof  linear  transformations  of  determinant  +1.   Cf.  Exercise?,  §81. 


INVARIANTS.    FIRST  PRINCIPLES   AND  ILLUSTRATIONS      97 

It  will  be  seen  that  absolute  invariants  are  simply  relative  in- 
variants of  weight  zero. 

The  fact  pointed  out  above  concerning  the  resultant  may  now  be 
stated  in  the  following  form  : 

Theorem  2.  The  resultant  of  a  set  of  n  linear  forms  in  n  variables 
is  a  relative  invariant  of  weight  1. 

We  pass  on  now  to  relative  covariants  : 

Definition  3.  If  we  have  a  system  consisting  of  a  number  of  n-ary 
forms  and  of  a  number  of  points  (3/^, ...  ^„),  {z^, ...  2„), ... ,  the  coor- 
dinates of  each  of  which  are  cogredient  with  the  variables  {x-^^ . . .  a;„)  of 
the  forms  ^  then  any  rational  function  of  the  coefficients  of  the  forms  and 
the  coordinates  of  the  points  which  is  merely  multiplied  by  the  fith  power 
(^fi  an  integer)  of  the  determinant  of  the  transformation  when  the  xs  are 
subjected  to  any  non-singular  linear  transformation  is  called  a  relative 
eovariant  of  weight  fi  of  the  system  of  forms  and  points.* 

We  may  regard  an  invariant  as  the  extreme  case  of  a  eovariant 
where  the  number  of  points"  is  zero.  The  other  extreme  case  is  that 
in  which  the  number  of  forms  is  zero.     Here  we  have  the  theorem: 

Theokem  3.      The  determinant 

'tW  ...  'tW 

is  a  relative  eovariant  of  weight  —  1  of  the  system,  of  points 
For  applying  the  transformation 


^n  —  ^nl^l  +    •••    H~  (^'iin^ni 

*  In  most  books  where  the  subject  of  covariants  is  treated,  the  same  letters 
(xi,  •  •  •  Xn)  are  used  for  one  of  the  points  as  for  the  variables  of  the  forms.  There  is 
no  objection  to  this,  and  it  is  sometimes  convenient.  We  prefer  to  use  a  notation 
which  shall  make  it  perfectly  clear  that  the  variables  of  the  forms  have  no  connection 
with  the  coordinates  of  the  points  except  that  they  are  cogredient  with  them. 

H 
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we  have 


^'    . . .  ^' 


/rN  ...  rW 


CiiX(   +...+e^„X:,    --c^i^l    +-+c„„Xi 


c„XW+-  +  <.,„XW 


''„i^iW  + 


11      ^m 


^n\  "'  ^nn 


^1     '" ^n 


Xf"J...Xf"^ 


Or 


-^1      •  •  •  ^n 


^11  '  *  •  ^'in 


o       .  . .  /> 


^1         *  •  *  2^M 


rM  ...  ^An] 


as  was  to  be  proved. 

Another  extremely  simple  case  arises  when  we  have  a  single 
form  and  a  single  point: 

Theorem  4.  The  system  consisting  of  the  formfix^^  ---^n)  ^^^ 
the  point  (?/i,  •  •  •  y^)  has  as  an  absolute  covariant  with  regard  to  linear 
transformations  .  .  . 

fKVv  "-yn)' 

For  let  us  denote  /  more  explicitly  as 

J  {a^,  (^2,  •  • ' ;  Xy,  •  • ' Xfi), 

where  a^  a^^  •••  are  the  coefficients  of/.     If  the  coefficients  after  the 
transformation  are  a'^  a'^^  ••>,  we  have 

J  yay,  a^t  *"  5  ^\i  **■  ^n)  ^^^J  v^i'  ^2'    '*  '  ^v  '  '  "^n)* 

This  being  true  for  all  values  of  the  a;'s,  will  be  true  if  the  a^'s  are 
replaced  by  the  y's.  But  when  this  is  done,  the  x'^s  will  be  replaced 
by  the  «/''s,  since  the  re's  and  ^'s  are  cogredient.     Accordingly 

f(a[,  4,  ...  ;  y'y  •'•y'^)=f  {a^,  a^,  ...  ;  y^,  ...y„), 

as  was  to  be  proved. 
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The  three  examples  of  invariants  and  covariants  which  have  been 
given  in  this  section  are  all  polynomials  in  the  coefficients  of  the 
forms  and  the  coordinates  of  the  points.  Such  invariants  we  shall 
speak  of  as  integral  rational  invariants  and  covariants.* 

Theorem  5.  The  weight  of  an  integral  rational  invariant  cannot 
he  negative.^ 

Let  ^j,  ^2'  •*•  5  ^i^^2'"'  '  **'  ^®  ^^^  coefficients  of  the  system  of 
forms,  and  let  c^j  be  the  coefficients  of  the  transformation.  It  is  clear 
that  the  coefficients  a\,  a'^,----,  5j,  Jg^---;  •••  after  the  transformation  are 
polynomials  in  the  as,  5's,  etc.,  and  in  the  c^/s.  Now  let  I  be  an 
integral  rational  invariant  of  weight  /i, 

7(aJ,  4,  •••  ;  h[,  5|,  •..  ;    -"):=  c''I{a^,  a^,  •••  ;  h^,  h^,---;  •••), 

where  e  is  the  determinant  of  the  transformation.      Suppose  now 
that  fjL  were  negative,  /jb=  —  v.     Then 

(5)  c^/«---;  ^^i,  •••;  ...)  =  /(«!,  •••;  K -"  ;  •••)• 

This  equality,  like  the  preceding  one,  is  known  to  hold  for  all 
values  of  the  c^/s  for  which  c  ^0.  Hence,  since  the  expressions  on 
both  sides  of  the  equality  are  polynomials  in  the  a's,  J's,  •  •  •  and  the 
c^/s^  we  infer,  by  an  application  of  Theorem  5,  §  2,  that  we  really 
have  an  identity. 

Let  us  now  assign  to  the  a's,  5's,  ••  any  constant  values  such  that 
I(a^^"'  ;  5j,  ...  ;  ...)^0.  Then  /(aj,-.;  b[,  •••  ;  •••)  will  be  a  poly- 
nomial in  the  c^/s  alone,  which,  from  (5),  cannot  be  identically  zero. 
The  identity  (5)  thus  takes  a  form  which  states  that  the  product  of 
two  polynomials  in  the  c^/s  is  a  constant,  and  since  the  first  of  these 
polynomials,  c'',  is  of  higher  degree  than  zero,  this  is  impossible. 

We  will  agree  in  future  to  understand  by  the  terms  invariant  and 
covarianU  invariants  or  covariants  (absolute,  relative,  or  arithmetical) 
with  regard  to  all  non- singular  linear  transformations.  If  we  wish  to 
consider  invariants  or  covariants  with  regard  to  other  sets  of  trans- 
formations, for  instance  with  regard  to  real  linear  transformations, 
this  fact  will  be  explicitly  mencioned. 

*  All  rational  invariants  and  covariants  may  be  formed  as  the  quotients  of  such  as 
are  integral  and  rational ;  cf.  Exercises  4,  5,  §  78. 
t  It  cannot  be  zero  either ;  cf.  Theorem  5,  §79. 
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Finally,  let  us  note  the  geometric  meaning  to  be  associated  with 
the  invariants  and  covariants  which  have  been  mentioned  in  this 
section.  We  confine  our  attention  to  the  case  of  four  variables. 
The  vanishing  of  the  resultant  of  four  linear  forms  gives  a  necessary 
and  sufficient  condition  that  the  four  planes  determined  by  setting 
the  forms  equal  to  zero  meet  in  a  point.  The  vanishing  of  the  co- 
variant  of  Theorem  3  is  a  necessary  and  sufficient  condition  that  the 
four  points  lie  in  a  plane.  The  vanishing  of  the  covariant  of 
Theorem  4  is  a  necessary  and  sufficient  condition  that  the  point 
(^1'  ^2'  ^3'  yd  ^^^  ^^  ^^^  surface  /=  0.  It  will  be  seen  that  in  all 
cases  we  are  thus  led  to  a  projective  property  ;  cf.  §§80,  81. 

32.   Some  Theorems  Concerning  Linear  Forms. 

Theorem  1.  Two  systems  of  n  linear  forms  in  n  variables  are 
equivalent  with  regard  to  non-singular  linear  transformations  if  neither 
resultant  is  zero. 

Let  {  a,^x,  -h  '••+  a^r,x„.  {  h^^x^  + h  h^^x,^ 


(1) 


(2) 


be  the  two  systems,  whose  resultants. 


^nl^l  H V^i 


^*t/nf 


a  = 


^ii"'*^i« 


b  = 


^ir'-hn 


h^^-"h, 


are,  by  hypothesis,  not  zero.     Applying  the  transformations 


a  { 


x^  —  ^11*^1  ~r  '*■  ~r  ^inP^w 


Xi  —  ^11^1  +  •  •  •  +  t>ijiX, 


.  •^/i  —  ^nl*^l  T~  ■  ■ '  ~r  ^nn'^n'> 


nf 


X 


n  —  ^nl-^1  +  •  •  *  "r  ^nrv^ni 


to  (1)  and  (2)  respectively,  they  are  both  reduced  to  the  normal  form 
(3) 


f       I 


^2' 


•  • 


^n* 


Now,  since  neither  a  nor  h  is  zero,  the  transformations  a  and  b  have 
inverses,  which  when  applied  to  (3)  carry  it  back  into  (1)  and  (2) 
respectively.     Hence  the  transformation  b~^a  carries  (1)  into  (2). 
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Theorem  2.  A  system  of  n  linear  forms  in  n  variables  has  no 
integral  rational  invariants  *  other  than  constant  multiples  of  powers 
of  the  resultant. 

Let  (1)  be  the  given  system  and  a  its  resultant,  and  let  c  be  the 
determinant  of  a  non-singular  linear  transformation  which  carries 
(1)  over  into  [  a[,x[  +  ■  ■  ■  +  a[X, 

(4)  :  :  :  :  :  ; 

If  we  call  the  resultant  of  (4)  a\  we  have 

a^  =  ac. 

Let  /(«ii,  •••  a„„)  be  any  integral  rational  invariant  of  the  system 
(1)  of  weight  /i,  and  write 

Then  I  =  c^L 

Now  let  us  assume  for  a  moment  that  a  ^  0 ,  and  consider  the  special 

transformation  which  carries  over  (1)  into  the  normal  form  (3).     In 

this  special  case  we  have  a'  =  1;  hence,  as  may  also  be  seen  directly, 

ac  =  l.     Calling  the  constant  value  which  I'  has  in  this  particular 

case  k,  we  have  ,  -r  -r 

k  =  c^I=  a~^I, 

or 

(5)  1=  ka^. 

This  equality,  in  which  k  is  independent  of  the  coefficients  a^^  has 
been  established  so  far  merely  for  values  of  the  a./s  for  which  a  =7^  0. 
Since  yu,  is  not  negative  (cf.  Theorem  5,  §  31),  we  can  now  infer 
that  (5)  is  an  identity,  by  making  use  of  Theorem  5,  §  2.  Thus  we 
see  that  I  is  merely  a  constant  multiple  of  a  power  of  the  resultant, 
as  was  to  be  proved. 

Corollary.  A  system  of  m  linear  forms  in  n  variables  has  no  in- 
tegral rational  invariants  (other  than  constants)  when  m  <i  n. 

For  such  an  invariant  would  also  be  an  integral  rational  invariant 
of  the  system  of  n  linear  forms  obtained  by  adding  n  —  m  new  forms  to 
the  given  system ;  and  hence  it  would  be  a  constant  multiple  of  a 

*  It  has  the  arithmetical  invariant  mentioned  in  Theorem  2,  §  30. 
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power  of  the  resultant  of  tliis  system.  This  power  must  be  zero, 
and  hence  the  invariant  must  be  a  mere  constant,  as  otherwise  it 
would  involve  the  coefficients  of  the  added  forms,  and  hence  would 
not  be  an  invariant  of  the  system  of  original  forms. 

EXERCISES 

1.  Prove  that  if  we  have  two  systems  of  n  -f  1  linear  forms  in  n  variables  whose 
matrices  are  both  of  rank  n,  a  necessary  and  sufficient  condition  that  these  two 
systems  be  equivalent  with  regard  to  non-singular  linear  transformation  is  that 
the  resultants  of  the  forms  of  one  set  taken  n  at  a  time  be  proportional  to  the 
resultants  of  the  corresponding  forms  of  the  other  set. 

2.  Generalize  the  preceding  theorem. 

3.  Prove  that  every  integral  rational  invariant  of  a  system  of  m  linear  forms  in 
n  variables  (m  >  ii)  is  a  homogeneous  polynomial  in  the  resultants  of  these  forms 
taken  n  at  a  time. 

4.  State  and  prove  the  tlieorems  analogous  to  the  theorems  of  the  present  sec- 
tion, including  the  three  preceding  exercises,  when  the  system  of  linear  forms  is  re- 
placed by  a  system  of  points. 

33.  Cross-ratio  and  Harmonic  Division.  Let  us  consider  any  four 
distinct  points  on  a  line 

V^)  \^V  ^IP  \*^2'  ^2)'  \'^3'  ^3)'  \*^4'  ^4)* 

We  have  seen,  in  §  31,  Theorem  3,  that  each  of  the  six  determinants 

^3^4        •^4^3'  ^4^2        *^2  4'  *^2  3        '^3'^2' 

is  a  covariant  of  weight  —  1.  The  ratio  of  two  of  these  determinants 
is  therefore  an  absolute  covariant,  and  we  might  be  tempted,  by 
analogy  with  the  examples  of  absolute  covariants  in  Exercise  1,  §  28, 
to  expect  that  it  might  have  a  geometric  meaning.  It  will  be  readily 
seen,  however,  that  this  is  not  the  case,  for  the  value  of  the  ratio  of 
two  of  the  determinants  (2)  will  be  changed  if  the  two  coordinates 
of  one  of  the  four  points  are  multiplied  by  the  same  constant,  and 
this  does  not  affect  the  position  of  the  points. 

It  is  easy,  however,  to  avoid  this  state  of  affairs  by  forming  such 
an  expression  as  the  following:* 

/g\  n    2  3  4^  =  (^1^2 ~  ^2^1)  (^3^4  ~  ^4^3)^ 

\^2^3~  ^3^2)  (^4^1  "~  ^1^4) 

*  The  reversal  of  sign  of  the  second  factor  in  the  denominator  is  not  essential,  but 
is  customary  for  a  reason  which  will  presently  be  evident. 
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which  is  also  an  absolute  co variant  of  the  four  points  (1),  and  is  called 
their  cross-ratio  or  anharmonic  ratio.  More  accurately  it  is  called  the 
cross-ratio  of  these  four  points  when  taken  in  the  order  written  in  (1).* 
In  order  to  determine  the  geometric  meaning  of  the  cross-ratio 
of  four  points,  let  us  first  .suppose  the  four  points  to  be  finite  so  that 
t^t^t^t^=^0.  Dividing  numerator  and  denominator  of  (1,  2,  3,  4)  by 
this  product,  we  find  the  following  expression  for  the  cross-ratio  in 
terms  of  the  non-homogeneous  coordinates  JQ  of  the  points, 

(4)  (1,  2,  3,  4)_  -^_^^^^_^^. 
Finally,  denoting  the  points  by  P^,  Pg'  ^3'  -^4'  ^^  ^^7  write 

(5)  (1,  2,  3,  ^)-p^J  F^-T^pJ  iVP3- 

In  words,  this  formula  tells  us  that  the  cross-ratio  of  four  finite 
points  is  the  ratio  of  the  ratio  in  which  the  second  divides  the  first 
and  third  and  the  ratio  in  which  the  fourth  divides  the  first  and 
third;  and  that  it  is  also  the  ratio  of  the  ratios  in  which  the  first 
and  third  divide  the  second  and  fourth. 

In  this  statement,  it  must  be  remembered  that  we  iiave  taken  the 
ratio  in  which  (J  divides  the  points  ^,  B  as  AC / BC,  so  that  the  ratio 
is  negative  if  C  divides  AB  internally,  positive  if  it  divides  it  externally. 

If  we  agree  that  the  point  at  infinity  on  a  line  shall  be  said  to 
divide  any  two  finite  points  A^  B  of  this  line  in  the  ratio  + 1  (and  this 
is  a  natural  convention  since  the  more  distant  a  point  the  more  nearly 
does  it  divide  AB  in  the  ratio  -fl)  it  is  readily  seen,  by  going  back 
to  formula  (3),  that  the  first  statement  following  (5)  still  holds  if  the 
second  or  fourth  point  is  at  infinity,  while  the  second  statement  holds 
if  the  first  or  third  is  at  infinity.  Thus  we  have  in  all  cases  a  simple 
geometric  interpretation  of  the  cross-ratio  of  four  distinct  points. 

The  special  case  in  which  four  points  P^,  P^^  P3,  P^  are  so  situ- 
ated that  (1,  2,  3,  4)  =  —  1  is  of  peculiar  importance.  In  this  case 
we  have 

(1,  2,  3,  4)  =  (1,  4,  3,  2)  =  (3,  2,  1,  4)  =  (3,  4,  1,  2)  =  (2,  1,  4,  3) 

=  (2,  3,  4,  1)  =  (4,  1,  2,  3)  =  (4,  3,  2,  1)=-1. 

*  If  these  four  points  are  taken  in  other  orders,  we  get  different  cross-ratios  : 
(1,  2,  4,  S),  (1,  4,  3,  2),  etc.     Cf.  Exercise  1  at  the  end  of  this  section 
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The  relation  is  therefore  merely  a  relation  between  the  two  pairs  of 
points  Pj,  P3  and  P^,  P^  taken  indifferently  in  either  order,  and  we 
say  that  these  two  pairs  of  points  divide  each  other  harmonically. 
From  the  geometric  meaning  of  cross-ratio,  we  see  that,  if  all  four 
points  are  finite,  the  pairs  P^,  Pg  and  P^,  P^  divide  each  other  har- 
monically when,  and  only  when,  P^  and  P^  divide  Pj,  P3  internally 
and  externally  in  the  same  ratio  ;  and  also  when,  and  only  when,  Pj 
and  Pg  divide  Pg,  P4  internally  and  externally  in  the  same  ratio.  If 
P2  or  P^  lies  at  infinity,  the  first  of  these  statements  alone  has  a 
meaning,  while  if  Pj  or  Pg  lies  at  infinity,  it  is  the  second  statement 
to  which  we  must  confine  ourselves. 

It  is  easily  seen  that  the  case  in  which  three  of  the  four  points, 
say  Pj,  Pg,  Pg,  coincide,  while  P^  is  any  point  on  the  line,  may  be 
regarded  as  a  limiting  form  of  two  pairs  of  points  which  separate  one 
another  harmonically.  It  is  convenient  to  include  this  case  under  the 
term  harmonic  division^  and  we  will  therefore  lay  down  the  definition : 

Definition.  Two  pairs  of  points  P-^,  P^  and  P^,  P^  on  a  line  are 
said  to  divide  one  another  harnfionically  if  they  are  distinct  and  their 
cross-ratio  taken  in  the  order  Pj,  Pg,  Pg,  P4  is  —  1,  and  also  if  at  least 
three  of  them  coincide. 

It  will  be  seen  that  the  property  of  two  pairs  of  points  dividing 
each  other  harmonically  is  a  projective  property  in  space  of  one 
dimension. 

The  most  important  applications  of  cross-ratio  come  in  geometry 
of  two,  three,  or  more  dimensions  where  the  points  are  not  determined 
as  above  by  two  coordinates  (or  one  non-homogeneous  coordinate),  but 
by  more.  Suppose,  for  instance,  we  have  four  distinct  finite  points 
on  a  line  in  space  of  three  dimensions.  Let  the  points  be  P^,  Pg, 
^j,  §2'  ^^^  suppose  the  coordinates  of  Pj,  Pg  are  (a:^,  y^,  Zy,  t^  and 
(a^g,  ^g,  ^g,  ^2)  respectively.  Then  the  coordinates  of  Q^  Q^  may  be 
written 

{x^  -f  Xx^,y^  4-  >-y2'^i  +^^2'^i  +  ^^2)'(^i  +  t^^vVi  +f^I/2'>h  +  f^^vh  +  f^hy 

Now,  let 

(6)  Ax'\'By+  Cz-{-Dt  =  0 

be  any  plane  through  Q^  but  not  through  Pg,  and  we  have 

{Ax^  4-  By^  -h  Cz^  -f  Bfi)  +  X(Ax^  h-  P^g  -f  Oz^  -h  Bt^)  =  0, 
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or,  since  P^  does  not  lie  on  (6), 

Ax^  -h  By^  +  Cz^  +  Bt^ 

Changing  to  non-homogeneous  coordinates,  we  have 

AX^  +  BY^-^CZ^  +  B  _        t^ 
AX^  +  BY^+  CZ^  +  B~        t^' 

If  PiM^  and  P^^M^  are  the  perpendiculars  from  Pj  and  P^  on  the 
plane  (6),  we  have 

P^Q^^P^M^^AX^-VBY^^CZ^^B  ^        t^ 
P^Q^     P^M^     AX^  +  BY^+OZ^  +  B  t^' 

In  exactly  the  same  way  we  get 
Consequently  ^iQi  /^iQ^^^ 

P^J  T^^-J 

This  is  the  cross-ratio  of  the  four  points  taken  in  the  order  P^,  Q^^ 

-^2'    ^2* 

It  is  readily  seen  that  if  one  of  the  two  points  Q-^  or  Q^  lies  at  in- 
finity, all  that  is  essential  in  the  above  reasoning  remains  valid,  and 
the  cross-ratio  is  still  \/\x. 

The  case  in  which  one  of  the  two  points  P^  or  P^  is  at  infinity 
may  be  reduced  to  the  case  just  considered  by  writing  for  the  coor- 
dinates of  Q^  and  Q^^  (f^,  t)^^  f^,  Tj)  and  (fg,  '^2'  ?2'  '^2)-  "^^^  coordi- 
nates of  Pj  and  P2  are  then 

(?l-^?2^  ^l-^^2^   fl-^^2'  ^^"^^V' 
(?1  -  f  2'  ^1  -  ^2^   ?1  -  ^2^   "^l  -  '^2)- 

Accordingly,  from  what  has  just  been  proved,  we  see  that  the 
cross-ratio  of  the  four  points  taken  in  the  order  Q^^  Pj,  Q^^  P^  is  X//-t. 
But  this  change  of  order  does  not  change  the  cross-ratio.  Hence  in 
all  cases  we  have  the  result : 
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Theorem  1.    The  cross-ratio  of  the  four  distinct  points 
-t  1         (^11  y\->  ^11  ^i)» 

-t  2  \*^2'  ^2'  ^2""  ^2)' 

Q^         (x^  +  Xa^g,  ^1  +  X?/2,  ^1  4-  X22,  ^1  +  X^2)» 

taken  in  the  order  Py,  Q^,  P^^  Q^^  is  X//i. 

From  this  theorem  we  easily  deduce  the  further  result : 

Theorem  2,  The  cross-ratio  of  four  points  on  a  line  is  invariant 
with  regard  to  non-singular  collineations  of  space.* 

For  the  four  points  P^,  P^,  Q^^  Q^  of  Theorem  1  are  carried  over 
by  a  non-singular  collineation  into  the  four  points 

Pi       \^ii  y\^  ^1^  ^1)1 

-*^2  V*^2'  ^2^  ^2'     2/' 

Q[         {x[  4-  X4,  y[  4-  Xy^,  4  4-  X2^,  t[  4-  X^^), 

^2         (4  +  /^4'  ^i  +  W2'  4  +  /^^2'  ^i  +  /^4)' 
whose  cross-ratio,  when  taken  in  the  order  P^,  ^^  P'^,  Q'^^  is  also  X/ft. 

Theorems  similar  to  Theorems  1  and  2  hold  in  space  of  two,  and  in 
general  in  space  of  w,  dimensions  and  may  be  proved  in  the  same  way. 

EXERCISES 

1.  Denote  the  six  determinants  (2)  by 

(1,2),        (1,3),        (1,4),        (3,4),        (4,2),        (2,3), 

and  write 

A  =  (1,  2)(3,  4),        B  =  (1,  3)(4,  2),         C=  (1,  4)(2,  3). 

Prove  that  six,  and  only  six,  cross-ratios  can  be  formed  from  four  points  by 
taking  them  in  different  orders,  namely  the  negatives  of  the  six  ratios  which  can  be 
formed  from  A,  B,  C  taken  two  and  two. 

2.  Prove  that  A  -\-  B  +  C  =  0,  and  hence  show  that  if  A.  is  one  of  the  cross- 
ratios  of  four  points,  the  other  five  will  be 

1     1      X         1         X-1         X 

X'      '^'  rrx'  "aT'  r::"i* 

*  This  also  follows  from  Exercise  6,  §  24. 
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3.  Prove  that  the  six  cross -ratios  of  four  distinct  points  are  all  different  from 
each  other  except  in  the  following  two  cases : 

(a)  The  case  of  four  harmonic  points,  where  the  values  of  the  cross-ratios 
are  -  1,  2,  i. 

(P)  The  case  known  as  four  equianharmonic  points,  in  which  the  values  of 
the  cross-ratios  are  —  2  ±  2  V  —  3. 

4.  Prove  Theorem  2,  §  24,  by  making  use  of  the  fact  that  the  cross-ratio  of  four 
points  on  a  line  is  unchanged  by  non-singular  projective  transformations  of  the 
line. 

5.  By  the  cross-ratio  of  four  planes  which  meet  in  a  line  is  understood  the 
cross-ratio  of  the  four  points  in  which  these  planes  are  met  by  any  line  which 
does  not  meet  their  line  of  intersection. 

Justify  this  definition  by  proving  that  if  the  equations  of  the  four  planes  are 

Pi  =  0»  J»i  +  V2  =  ^^P2  =  0'  Pi  +  t^P2  =  ^ 

(pj  and/?2  homogeneous  linear  polynomials  in  x,  y,  z,  t),  the  cross-ratio  of  the  four 
points  in  which  any  line  which  does  not  meet  the  line  of  intersection  of  the  planes 
is  met  by  the  planes  is  A/  fx. 

6.  Prove  that. the  cross-ratio  of  four  planes  which  meet  in  a  line  is  invariant 
with  regard  to  non-singular  coUineations. 

34.     Plane-Coordinates^  and  Contragredient  Variables.     If  w^,  Wg, 

Wg,  u^  are  constants,  and  x^^  x^-,  x^^  x^  are  the  homogeneous  coordinates 
of  a  point  in  space,  the  equation 

(1)  u^x^  +  i^2^2  +  ^^s  +  '^A  —  ^ 

represents  a  plane.  Since  the  values  of  the  u^  determine  the  posi- 
tion of  this  plane,  the  u's  may  be  regarded  as  coordinates  of  the  plane. 
We  will  speak  of  them  2i^  plane-coordinates^  just  as  the  a;'s  (each  set 
of  which  determines  a  point)  are  called  point-coordinates.  And  just 
as  we  speak  of  the  point  {x^^  x^^  x^,  x^  so  we  will  speak  of  the  plane 
(^6^,  ^2,  %,  u^.  The  u^  are  evidently  analogous  to  homogeneous 
coordinates  in  that  if  they  be  all  multiplied  by  the  same  constant, 
the  plane  which  they  determine  is  not  changed. 

Suppose  now  that  we  consider  the  re's  as  constants  and  allow  the 
«*'s  to  vary,  taking  on  all  possible  sets  of  values  which,  with  the  fixed 
set  of  values  of  the  a^'s,  satisfy  (1).  This  equation  will  now  repre- 
sent a  family  of  planes,  infinite  in  number,  each  one  of  which  is  de- 
termined by  a  particular  set  of  values  of  the  2^'s  and  all  of  which  pass 
through  the  fixed  point  (a:^,  x^,  x^.  x^.  The  equation  (1)  may  there- 
fore be  regarded  as  the  equation  of  a  point  in  plane-coordinates^  since 
it  is  satisfied  by  the  coordinates  of  a  moving  plane  which  envelops 
this  point,  just  as  when  the  xs  vary  and  the  it's  are  constant,  it  is 
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the  equation  of  a  plane  in  point-coordinates,  since  it  is  satisfied  by  the 
coordinates  of  a  moving  point  whose  locus  is  the  plane.* 

In  the  same  way,  a  homogeneous  equation  of  degree  higher  than 
the  first  in  the  w's  will  be  satisfied  by  the  coordinates  of  a  moving 
plane  which  will,  in  general,  envelop  a  surface.  The  equation  will 
then  be  called  the  equation  of  this  surface  in  plane-coordinates,  f 

Let  us  now  subject  space  to  the  collineation 

C  X^  =  C^^X^  -f-  Ci^X^  +  (^i^^^  ~r  0^^^  \l  =  I,  J,  O,  4j. 

We  will  assume  that  the  determinant  c  of  this  transformation  is  not 
zero  ;  and  we  will  denote  the  cofactors  in  this  determinant  by  C^j, 
Then  the  inverse  of  the  transformation  c  may  be  written 

c-i  x^  =  ^x'  +  ^'xl,  +  -^4  +  ^x[  (i  =  1,  2,  3,  4). 

ecce 

Substituting  these  expressions,  we  see  that  the  plane  (1)  goes  over 
into 

(2)  u[x[  +  u^xl^  4-  ^^3  +  ^4^4  =  0, 

^^e^e  r         r         r         r 

d  ui=^u,  +  ^u^  +  ^u^  +  ^%4  (^  =  1,  2,  3,  4). 

c  e  e  c 

We  thus  see  that  the  us  have  also  suffered  a  linear  transformation, 
though  a  different  one  from  the  a:'s,  namely,  the  transformation  whose 
matrix  is  the  conjugate  (cf .  §  7,  Definition  2)  of  c~^.  This  transforma- 
tion d  of  the  plane-coordinates  is  merely  another  way  of  expressing 
the  collineation  which  we  have  commonly  expressed  by  the  transfor- 
mation c  of  the  point-coordinates.  The  two  sets  of  variables  x  and  u 
are  called  contragredient  variables  according  to  the  following 

Definition  1.  Two  sets  of  n  variables  each  are  called  contragre- 
dient if  whenever  one  is  subjected  to  a  non-singular  linear  transformation^ 
the  other  is  subjected  to  the  transformation  which  has  as  its  matrix  the 
conjugate  of  the  inverse  of  the  matrix  of  the  first, 

*  Similarly,  in  two  dimensions,  the  equation 

uiXi  4-  U2X2  +  UsXs  =  0 

represents  a  line  in  the  point-coordinate  (xi,  x^-,  xz)  if  Wi,  W2,  W3  are  constants,  or  a 
point  in  the  line-coordinates  (wi,  W2,  W3)  if  Xi,  x^,  Xz  are  constants, 
t  An  example  of  this  will  be  found  in  §  53. 
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Precisely  the  reasoning  used  above  in  the  case  of  four  variables 
establishes  here  also  the  theorem  ; 

Theorem.*  If  the  two  sets  of  contragredient  variables  rr^,  •••  x^ 
and.  u^^  '"  u^  are  carried  over  by  a  linear  transformation  into  x[^  •  •  •  x'f^ 
and  u[^  •••  u'n^  then 

will  go  over  into  u[x[  +  u'^x'^  +  •  •  •  +  u^x^. 

In  connection  with  this  subject  of  contragredient  variables  it  is 
customary  to  introduce  the  conception  of  contravariants^  just  as  the 
conception  of  covariants  was  introduced  in  connection  with  the  sub- 
ject of  cogredient  variables.     For  this  purpose  we  lay  down  the 

Definition  2.  If  we  have  a  system  of  forms  in  {x^^  •••  x^  and 
also  a  number  of  sets  of  variables,  {u[, -"  u'^),  {u'l^  •••  u^l),  •••,  contra- 
gredient to  the  x's^  any  rational  function  of  the  u>s  and  the  coefficients  of 
the  forms  which  is  unchanged  by  a  non-singular  linear  transformation  of 
the  x's  except  for  being  multiplied  by  the  ^th  power  (/jl  an  integer)  of  the 
determinant  of  this  transformation  is  called  a  contravariant  of  weight  fi. 

Thus  the  theorem  that  the  resultant  of  n  linear  forms  in  n  variables 
is  an  invariant  of  weight  1  may,  if  we  prefer,  be  stated  in  the  form : 
If  we  have  n  sets  of  n  variables  each,  (u[^  •••  t^J^),  •••  {u^{^\  •••  i^Jf^), 
each  of  which  is  contragredient  to  the  variables  (:rp  •••  a;^),  the  de- 
terminant of  the  us  is  a  contravariant  of  weight  l.f 

It  will  be  seen  that  the  conception  of  contravariant,  though 
sometimes  convenient,  is  unnecessary,  since  the  contragredient  vari- 
ables may  always  be  regarded  as  the  coefficients  of  linear  forms,  and, 
when  so  regarded,  the  contravariant  is  merely  an  invariant. 

Similarly,  the  still  more  general  conception  of  mixed  concomitants, 
in  which,  besides  the  coefficients  of  forms  and  the  contragredient 
variables,  certain  sets  of  cogredient  variables  are  involved,  f  reduces 
to  the  familiar  conception  of  covariants  if  we  regard  the  contra- 
gredient variables  as  coefficients  of  linear  forms. 

*  This  is  really  a  special  theorem  in  the  theory  of  bilinear  forms.  Cf.  the  next 
chapter. 

t  For  other  examples  of  contravariants  in  which  coefficients  also  occur,  see 
Chap.  XII. 

J  An  example  is  uiXi  +  u^xi  +  •••  +  w,tX„,  the  theorem  above  stating  that  this  is  an 
absolute  mixed  concomitant. 
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35.  Line-Coordinates  in  Space.  A  line  is  determined  by  two 
points  (?/p  ^2'  Vz^  ^4)'  (^r  ^v  %  ^4)  which  lie  on  it.  It  is  clear  that 
these  eight  coordinates  are  not  all  necessary  to  determine  the  line  ; 
and  it  will  be  seen  presently  that  the  following  six  quantities  deter- 
mine the  line  completely,  and  may  be  used  as  line-coordinates^ 


where 

(1) 


i^l2i   P\Z->   Pu^  Psi'   P42'>   i^23' 


Pv  = 


Vi    Vj 


In  other  words,  the  jo's  are  the  two-rowed  determinants  of  the  matrix 

yi    Vi    Vz    y^ 

^1     ^2     ^3     ^4 

except  that  the  sign  of  the  determinant  obtained  by  striking  out 
the  first  and  third  column  has  been  changed.     These  six  jt?'s  are  not 
all  zero  if,  as  we  assume,  the  two  points  y  and  z  are  distinct. 
These  six  jo's  are  connected  by  the  relation 

(2)  PviPz^  +  i?l3i^42  +  i^l4i^23  =  ^'* 

as    may  be    seen   either    directly  or    by  expanding    the   vanishing 
determinant  y^    y^    y^    y^ 

^1        ^2        ^3        ^4 

y\   y^  ^3   ^4 

^1    ^2    %    ^' 

by  Laplace's  method  in  terms  of  the  minors  of  the  first  two  rows. 

That  the  jo's  may  really  be  used  as  line-coordinates  is  shown  by 
the  following  two  theorems: 

Theorem  1.  When  a  line  is  given^  its  line-coordinates  py  are  com- 
pletely determined  except  for  an  arbitrary  factor  different  from  zero  by 
which  they  may  all  be  multiplied. 

The  definition  (1)  of  the  jo's  shows  that  they  may  all  be  multi- 
plied by  an  arbitrary  factor  different  from  zero  without  affecting 
the  position  of  the  line,  since  the  y's  (and  also  the  55's)  may  be  multi- 
plied by  such  a  factor  without  affecting  the  position  of  the  point. 


*  Cf .  Exercise  2,  §  33. 


INVARIANTS.    FIRST  PRINCIPLES   AND  ILLUSTRATIONS      111 

In  order  to  prove  our  theorem  it  is  sufficient  to  show  that,  if 
instead  of  the  two  points  used  above  for  determining  the  ^'s  we  use 
two  other  points  of  the  line, 

the  line-coordinates 


y^i 

Yi 

p  = 

% 

J 

tj 

Zi 

Zi 

thus  determined  will  be  proportional  to  the  jt?y's.  Since  the  points 
Y  and  Z  are  collinear  with  the  distinct  points  ^,  2,  they  are  linearly- 
dependent  upon  them  and  we  may  write 


^i  ~  ^iVi'^  *^2  ^i'>  ^i  —  ^1  J/i  +  ^^2  ^i 


{i  =  1,  2,  3,  4). 


Accordingly 


^ii  = 

^1 

^2 

^1 

^2 

V 

h 

^2 

h 

^2 

=  Kp 


iji 


where  K4^  0,  as  Y  and  Z  are  distinct  points. 

Theorem  2.  Any  six  constants  p^j  satisfying  the  relation  (2)  and 
not  all  zero  are  the  line-coordinates  of  one,  and  only  one,  line. 

That  they  cannot  be  the  coordinates  of  more  than  one  line  may  be 
seen  as  follows:  Suppose  the  pi/s  to  be  the  coordinates  of  a  line, 
and  take  two  distinct  points  y  and  z  on  the  line.  The  coordinates 
of  these  points  may  then  be  so  determined  that  relations  (1)  hold. 
Let  us  suppose,  for  definiteness,  that  p^^  =^  0.*  Now,  consider  the 
point  whose  coordinates  are  c-^y^-^- c^z^.  By  assigning  to  (?j  and  c^ 
first  the  values  —  z^  and  y-^,  then  the  values  —  z^  and  y^^  we  get  the 
two  points 

(3)  (0'  Pit  Pi3^  Pu)^         (P2V  ^'  P2S^  Pu)^ 

where,  by  definition,  p^j  =  —  Pji- 

These  two  points  are  distinct,  since  for  the  first  of  them  the  first 
coordinate  is  zero  and  the  second  is  not,  while  for  the  second  the 
second  coordinate  is  zero  and  the  first  is  not.  These  points  ac- 
cordingly determine  the  line,  and  since  they,  in  turn,  are  deter- 
mined by  the  jt?'s,  we  see  that  the  line  is  uniquely  determined 
by  the  p's. 

*  By  a  slight  modification  of  the    formulse  this  proof  will  apply  to  the  case  in 
which  any  one  of  the  p's  is  assumed  different  from  zero. 
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It  remains,  then,  merely  to  show  that  any  set  of  joey's,  not  all  zero, 
which  satisfy  (2)  really  determine  a  line.  For  this  purpose  we  again 
assume  jp-^^,  ^^  ^  *  ^^^  consider  the  two  points  (3)  which,  as  above,  are 
distinct.     The  line  determined  by  them  has  as  its  coordinates 

i^l2'    P12P1Z''    P12PW    ~  Pl3P42~  PuP2S'    ^^12^42'    ^^12^23- 

Using  the  relation  (2),  the  fourth  of  these  quantities  reduces  to 
F12F34'  ^^  that,  remembering  that  the  coordinates  of  a  line  may  be 
multiplied  by  any  constant  different  from  zero,  we  see  that  we 
really  have  a  line  whose  coordinates  are  p^j. 

In  a  systematic  study  of  three-dimensional  geometry  these  line- 
coordinates  play  as  important  a  part  as  the  point-  or  plane-coordi- 
nates ;  and  in  the  allied  algebraic  theories  we  shall  have  to  consider 
expressions  having  the  invariant  property,  into  which  these  line- 
coordinates  enter  just  as  point-coordinates  occur  in  covariants  and 
plane-coordinates  in  contravariants.  We  may,  if  we  please,  regard 
these  expressions  as  ordinary  covariants,  since  the  line-coordinates 
are  merely  functions  of  the  coordinates  of  two  points,  but  the  co- 
variants  we  get  in  this  way  are  covariants  of  a  special  sort,  since  the 
coordinates  of  the  two  points  occur  only  in  the  combinations  (1). 

As  an  example,  let  us  consider  four  points 

(^i,  ^t,  ^i-,  ti)  {i  =  1,  2,  3,  4). 

The  determinant  of  these  sixteen  coordinates  is,  by  Theorem  3,  §  31, 
a  CO  variant  of  weight  —  1.  Let  us  denote  by  plj  and  p'^j  the  coordi- 
nates of  the  lines  determined  by  the  first  two  and  the  last  two  points 
respectively.  Expanding  the  four-rowed  determinant  just  referred 
to  by  Laplace's  method  according  to  the  two-rowed  determinants  of 
the  first  two  rows,  we  get 

(^)  PnPsi  +  Pl2PBi  +  PlsPi2  +  Pl3P^2  +  PuP2Z  +  ^14^23* 

This,  then,  is  an  expression  having  the  invariant  property  and  in- 
volving only  line-coordinates. 

Since  the  vanishing  of  the  four-rowed  determinant  from  which 
we  started  gave  the  condition  that  the  four  points  lie  in  a  plane,  it 
follows  that  the  vanishing  of  (4)  gives  a  necessary  and  sufficient 
condition  that  the  two  lines  p^  and  p'^  lie  in  a  plane,  or,  what 
amounts  to  the  same  thing,  that  they  meet  in  a  point. 

*  By  a  slight  modification  of  the  formulae,  this  proof  will  apply  to  the  case  in 
which  any  one  of  the  j9's  is  assumed  different  from  zero. 
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EXERCISES 

1.  Prove  that,  if  the  point-coordinates  are  subjected  to  the  linear  transfor- 
mation f  ,  ,  ,  /■'      t    a    o    A\ 

the  line-coordinates  will  be  subjected  to  the  linear  transformation 

+  {Ci^Cj2  —  Ci2Cj4)p42  +  iCi2Cj3  -  C,-3Cy2) />23- 

2.  A  plane  is  determined  by  three  points 

(yi,  2/2,  1/3,  2/4),  (2:1,  22,  Z3,  Z^),  (Wi,  W2,  W3,  W4). 

Prove  that  the  three-rowed  determinants  of  the  matrix  of  these  three  points  may 
be  used  as  coordinates  of  this  plane,  and  that  these  coordinates  are  not  distinct 
from  the  plane-coordinates  defined  in  §  34. 

3.  A  line  determined  by  two  of  its  points  may  be  called  a  ray,  and  the  line- 
coordinates  of  the  present  section  may  therefore  be  called  ray-coordinates.  A  line 
determined  as  the  intersection  of  two  planes  may  be  called  an  axis.  If  (wi,  u^, 
Us,  M4)  and  (vi,  V2,  V3,  V4)  are  two  planes  given  by  their  plane-coordinates,  discuss 
the  axis-coordinates  of  their  intersection, 

g'12,     ^13,     <?14,     5^34,     ^42,     (?23j 

where  qij  —  UiVj  —  iijVi. 

4.  Prove  that  ray-coordinates  and  axis-coordinates  are  not  essentially  differ- 
ent by  showing  that,  for  any  line,  the  ^'s,  taken  in  the  order  written  in  Exercise  3, 
are  proportional  to  the  jo's  taken  in  the  order 

PS^J    Pi2,    P23y    P12,    Pl3,    Pw 

5.  A  point  is  determined  as  the  intersection  of  three  planes 

(Mi,  M2,  U3,  W4),  (Vl,  V2,  V3,  Vi),  (Wl,  ^^2,  «^3,  W^). 

Prove  that  the  three-rowed  determinants  of  the  matrix  of  these  planes  may  be 
used  as  coordinates  of  this  point,  and  that  they  do  not  differ  from  the  ordinary 
point-coordin  ates. 

Hence,  show  that  all  covariants  may  be  regarded  as  invariants. 


CHAPTER  VIII 


BILINEAR  FORMS 

36.  The  Algebraic  Theory.  Before  entering  on  the  study  of 
quadratic  forms,  whicli  will  form  the  subject  of  the  next  five  chapters, 
we  turn  briefly  to  a  very  special  type  of  quadratic  form  in  2n  varia- 
bles, known  as  a  bilinear  form,  and  which,  as  its  name  implies,  forms 
a  natural  transition  between  linear  and  quadratic  forms. 

Definition  1.  A  polynomial  in  the  2n  variables  {xp  .••  x^)', 
il/v  ' ' '  Vn)  ^'^  called  a  bilinear  form  if  each  of  its  terms  is  of  the  first 
degree  in  the  x's  and  also  of  the  first  degree  in  the  y's. 

Thus,  for  71  =  3,  the  most  general  bilinear  form  is 

+  ^21'^2^1  ~^  ^22^2^2  "'"  ^23^2^3 

+  ^3i%yi  +  ^32^3^2  "'"  ^33^3^3' 

3 

This  may  be  denoted,  for  brevity,  by  ^a^p^yj ;  and,  in  general,  we 
may  denote  the  bilinear  form  in  2  ^  variables  by 


(1) 

The  matrix 


2.aijXjyj. 


a  = 


a 


11 


a 


in 


a 


n\ 


a. 


is  called  the  matrix  of  the  form  (1) ;  its  determinant,  the  determinant 
of  the  form ;  and  its  rank,  the  rank  of  the  form.*  A  bilinear  form 
is  called  singular  when,  and  only  when,  its  determinant  is  zero. 

*  It  should  be  noticed  that  the  bilinear  form  is  completely  determined  when  its 
matrix  is  given,  so  there  will  be  no  confusion  if  we  speak  of  the  bilinear  form  a.  If 
two  bilinear  forms  have  matrices  ai  and  a2,  their  sum  has  the  matrix  ai  +  a2-  The 
bilinear  form  whose  matrix  is  aiag  is  not  the  product  of  the  two  forms,  but  is  sometimes 
spoken  of  as  their  symbolic  product. 

114 
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Let  us  notice  that  the  bilinear  form  (1)  may  be  obtained  by 
starting  from  the  system  of  n  linear  forms  in  the  y's  of  matrix  a, 
multiplying  them  respectively  by  a:^,  a^g,  •••  x^^  and  adding  them 
together.  It  can  also  be  obtained  by  starting  from  the  system 
of  n  linear  forms  in  the  x^  whose  matrix  is  the  conjugate  of  a, 
multiplying  them  respectively  by  y^,  y^,  •••  y„,  and  adding  them 
together. 

Using  the  first  of  these  two  methods,  we  see  (cf.  Theorem  1,  §  31) 
that  if  the  y's  are  subjected  to  a  linear  transformation  with  matrix 
d,  the  bilinear  form  is  carried  over  into  a  new  bilinear  form  whose 
matrix  is  ad.  Using  the  second  of  the  above  methods  of  building 
up  the  bilinear  form  from  linear  forms,  we  see  that  if  the  x's  are 
subjected  to  a  linear  transformation  with  matrix  c,  we  get  a  new 
bilinear  form  the  conjugate  of  whose  matrix  is  a'c,  where  accents 
are  used  to  denote  conjugate  matrices.  The  matrix  of  the  form 
itself  is  then  (cf.  Theorem  6,  §  22)  c'a.* 

Combining  these  two  facts,  we  have 

Theorem  1.  If,  in  the  hilinear  form  (1)  with  matrix  a,  we  subject 
the  x's  to  a  linear  transformation  with  matrix  c  and  the  y's  to  a  linear 
transformation  with  matrix  d,  we  obtain  a  new  bilinear  form  with  matrix 
c'ad,  where  d  is  the  conjugate  of  c. 

Considering  the  determinants  of  these  matrices,  we  may  say : 

Theorem  2.  The  determinant  of  a  bilinear  form  is  multiplied  by 
the  product  of  the  determinants  of  the  transformations  to  which  the  x*s 
and  y's  are  subjected.-f 

We  also  infer  from  Theorem  1,  in  combination  with  Theorem  7, 
§  25,  the  important  result :  J 

Theorem  3.  The  rank  of  a  bilinear  form  is  an  invariant  with  re- 
gard to  non- singular  linear  transformations  of  the  x's  and  y's. 

Definition  2.  A  bilinear  form  whose  matrix  is  symmetric  is 
called  a  symmetric  bilinear  form. 

*  These  results  may  also  be  readily  verified  without  referring  to  any  earlier  theorems. 

t  This  theorem  tells  us  that  the  determinant  of  a  bilinear  form  is,  in  a  generalized 
sense,  a  relative  invariant.  Such  invariants,  where  the  given  forms  depend  on  several 
sets  of  variables,  are  known  as  combinants. 

t  This  result  may  also  be  deduced  from  Theorem  2,  §  30. 
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Theorem  4.  A  symmetric  bilinear  form  remains  symmetric  if  we 
subject  the  xs  and  the  y's  to  the  same  linear  transformation. 

For  if  c  is  the  matrix  of  the  transformation  to  which  both  the  a;'s 
and  the  ^'s  are  subjected,  the  matrix  of  the  transformed  form  will, 
by  Theorem  1,  be  c^ac.  Remembering  that  a,  being  symmetric,  is 
its  own  conjugate,  we  see,  by  Theorem  6,  §  22,  that  c'ac  is  its  own 
conjugate.     Hence  the  transformed  form  is  symmetric. 

EXERCISES 

1.  Prove  that  a  necessary  and  sufficient  condition  for  the  equivalence  of  two 
bilinear  forms  with  regard  to  non-singular  linear  transformations  of  the  x's  and 
^*s  is  that  they  have  the  same  rank. 

2.  Prove  that  a  necessary  and  sufficient  condition  that  it  be  possible  to  factor 
a  bilinear  form  into  the  product  of  two  linear  forms  is  that  its  rank  be  zero  or  one. 

3.  Prove  that  every  bilinear  form  of  rank  r  can  be  reduced  by  non-singular 
linear  transformations  of  the  a;'s  and  y's  to  the  normal  form 

xiyi  +  ^2y2+  •••  -\-Xryr' 

4.  Do  the  statements  in  the  preceding  exercises  remain  correct  if  we  confine 
our  attention  to  real  bilinear  forms  and  real  linear  transformations? 

5.  Prove  that  a  necessary  and  sufficient  condition  that  the  form 

Xiyi  +  X2y2+  •••  ^-Xnyn 

should  be  unchanged  by  linear  transformations  of  the  ar's  and  of  the  ^'s  is  that 
these  be  contragredient  transformations. 

37.  A   Geometric  Application.      Let  {x^,  a; 2,  ^^3)  and  («/i,  3/2?  Vz)  ^^ 

homogeneous  coordinates  of  points  in  a  plane,  and  let  us  consider 

the  bilinear  equation 

3 

(1)  l.aijXiyj  =  0, 

If  (3/j,  y^,  y^)  is  a  fixed  point  P,  then  (1),  being  linear  in  the  x's^ 
is  the  equation  of  a  straight  line  p.  The  only  exception  is  when  the 
coefficients  of  (1),  regarded  as  a  linear  equation  in  the  a;'s,  are  all 
zero,  and  this  cannot  happen  if  the  determinant  of  the  form  is  differ- 
ent from  zero.  Thus  we  see  that  the  equation  (1)  causes  one,  and 
only  one,  line  p  to  correspond  to  every  point  P  of  the  plane,  pro- 
vided the  bilinear  form  in  (1)  is  non-singular. 

Conversely,  if 

(2)  Ax^  +  Bx^  4-  O'ajg  =  0 
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is  a  line  jt?,  there  is  one,  and  only  one,  point  P  which  corresponds  to  it 
by  means  of  (1),  provided  the  bilinear  form  in  (1)  is  non-singular.  For 
if  P  is  the  point  (y^,  y^,  ^g),  the  equation  of  the  line  corresponding  to  it 
is  (1),  and  the  necessary  and  sufficient  condition  that  this  line  coincide 
with  (2)  is  a^^y^  +  a^^y^  +  a^^^  =  pA, 

HlVl  +  ^22^2  +  ^23^3  =  9^'> 
H\y\  +  ^83^2  +  ^33^3  =9^'> 

where  p  is  a  constant,  not  zero.  For  a  given  value  of  /o,  this  set  of 
equations  has  one,  and  only  one,  solution  (y^,  y^^  3/3),  since  the  deter- 
minant a  is  not  zero,  while  a  change  in  p  merely  changes  all  the  i/'s 
in  the  same  ratio.     Hence, 

Theorem.    If  the  bilinear  equation  (1)  is  non-singular^  it  establishes 
a  one-to-one  correspondence  between  the  points  and  lines  of  the  plane. 
This  correspondence  is  called  a  correlation, 

EXERCISES 

1.  Discuss  the  singular  correlations  of  the  plane,  considering  separately  the 
cases  in  which  the  rank  of  the  bilinear  form  is  2  and  1. 

2.  Examine  the  corresponding  equation  in  three  dimensions,  that  is,  the  equa- 
tion obtained  by  equating  to  zero  a  bilinear  form  in  which  n  =  4,  and  discuss  it  for 
all  possible  suppositions  as  to  the  rank  of  the  form. 

3.  Show  that  a  necessary  and  sufficient  condition  for  three  or  more  lines, 
which  correspond  to  three  or  more  given  points  by  means  of  a  non-singular  corre- 
lation, to  be  concurrent  is  that  the  points  be  collinear. 

4.  Show  that  the  cross-ratio  of  any  four  concurrent  lines  is  the  same  as  that  of 
the  four  points  to  which  they  correspond  by  means  of  a  non-singular  correlation. 

5.  Let  P  be  any  point  in  a  plane  and  p  the  line  corresponding  to  it  by  means 
of  a  non-singular  correlation.  Prove  that  a  necessary  and  sufficient  condition  for 
the  lines  corresponding  to  the  points  of  p  to  pass  through  P  is  that  the  bilinear 
form  be  symmetrical. 

6.  State  and  prove  the  corresponding  theorem  for  points  and  planes  in  space 
of  three  dimensions,  showing  that  here  it  is  necessary  and  sufficient  that  the  form 
be  symmetrical  or  skew-symmetrical.* 

*  The  correlation  given  by  a  symmetric  bilinear  equation  is  known  as  a  reciproca- 
tion. By  reference  to  the  formulae  of  the  next  chapter,  it  will  be  readily  seen  that  in 
this  case  every  point  corresponds,  in  the  plane,  to  its  polar  with  regard  to  a  fixed  conic  ; 
in  space,  to  its  polar  plane  with  regard  to  a  fixed  quadric  surface.  The  skew-symmetric 
bilinear  equation  gives  rise  in  the  plane  merely  to  a  very  special  singular  correlation. 
In  space,  however,  it  gives  an  important  correlation  which  is  in  general  non-singular 
and  is  known  as  a  null-system.  Cf.  any  treatment  of  line  geometry,  where,  however, 
the  subject  is  usually  approached  from  another  side. 


CHAPTER  IX 

GEOMETRIC   INTRODUCTION   TO   THE  STUDY   OF  QUADRATIC 

FORMS 

38.  Quadric  Surfaces  and  their  Tangent  Lines  and  Planes.  If 
a^j,  x^,  x^  are  homogeneous  coordinates  in  a  plane,  we  see,  by  reference 
to  §  4,  that  the  equation  of  any  conic  may  be  written 

Similarly,  in  space  of  three  dimensions,  the  equation  of  any  quadric 
surface  may  be  written 

This  form  may  be  made  still  more  symmetrical  if,  besides  the 
coefficients  a^g,  <^i3,  ^14,  ^534,  ^3^42'  ^^23'  ^®  introduce  the  six  other  con- 
stants ^21,  <^3i,  ^41,  <^43,  <^24'  %2'  defined  by  the  general  formula 

The  equation  of  the  quadric  surface  may  then  be  written 

CZ-1-iJ^-i  I*    Ct-tn^-iXn    "1       Ct-toX-lXo    "1"    CI  •*  AUy-i*C  A 

"^~   Cl/n-tXnX-i    ■"]"    (AjcynJL/n  ~\~   {XnnJbnJbn    ~]~    Ctn^«/-'0'^A 

"^  (XrytXnX-t    ~|~   (XncyXrpUty   "y"   6t'qq«*^q  "l       (^QaXoXa 

I    <^4j^4^i  ~T~  '^42'^4'^2  "■"  ^43'^4'^3  "■"  ^44*^4      ^  ^5 

or  for  greater  brevity 

(1)  2  a^jXiXj  =  0. 


Definition  1.  —  The  matrix  of  the  sixteen  a^s  taken  in  the  order 
written  above  is  called  the  matrix  of  the  quadric  surface  (1),  the  deter- 
minant of  this  matrix  is  called  the  discriminant  of  the  quadric  surface^ 
its  rank  is  called  the  rank  of  the  quadric  surface^  and  if  the  discrimi- 
nant vanishes^  the  quadric  surface  is  said  to  he  singular. 

iia 
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A  fundamental  problem  is  the  following:  If  (y^,  y^^  yg,  y^  and 
(zy,  z^^  ^3,  z^  are  two  points,  in  what  points  does  the  line  yz  meet  the 
surface  (1)? 

The  coordinates  of  any  point  on  yz^  other  than  y,  may  be  written 

A  necessary  and  sufficient  condition  for  this  point  to  lie  on  (1)  is 

4 

^aij{z,  +  \y,)  {Zj.  +  \yj)  =  0, 

or  expanded, 

(2)  ^%-ZiZj  +  2  X^aijyiZj  +  X'^la^iyj  =  0. 

If  the  point  y  does  not  lie  on  (1),  this  is  a  quadratic  equation  in  X. 
To  each  root  of  this  equation  corresponds  one  point  where  the  line 
meets  the  quadric.  Thus  we  see  that  every  line  through  a  point  y 
which  does  not  lie  on  a  quadric  surface,  meets  this  surface  either  in 
two,  and  only  two,  distinct  points,  or  in  only  one  point. 

On  the  other  hand,  if  y  does  lie  on  (1),  the  equation  (2)  reduces  to 
an  equation  of  the  first  degree,  provided  ^a^jy^Zj  ^  0.  In  this  case, 
also,  the  line  meets  the  surface  in  two,  and  only  two,  distinct  points, 
viz.,  the  point  y  and  the  point  corresponding  to  the  root  of  the  equa- 
tion of  the  first  degree  (2). 

Finally,  if  ^a^jy^yj  =  ^a^jyiZj  =  0,  the  first  member  of  equation 
(2)  reduces  to  a  constant,  so  that  (2)  is  either  satisfied  by  no  value 
of  X,  in  which  case  the  line  meets  the  surface  at  the  point  y  only, 
or  by  all  values  of  X  (if  ^a^jZiZj  =  0),  in  which  case  every  point  on  the 
line  is  also  a  point  on  the  surface. 

Combining  the  preceding  results  we  may  say: 

Theorem  1.  If  a  quadric  surface  and  a  straight  line  are  given^ 
one  of  the  following  three  cases  must  occur  : 

(1)  The  line  meets  the  quadric  in  two^  and  only  two^  points^  in  which 
case  the  line  is  called  a  secant, 

(2)  The  line  meets  the  quadric  in  one^  and  only  one,  point,  in  which 
case  it  is  called  a  tangent.* 

(3)  Every  point  of  the  line  is  a  point  of  the  quadric.  In  this  case 
the  line  is  called  a  ruling  of  the  quadric.  f 

*  We  shall  presently  distinguish  between  true  tangents  and  pseudo-tangents. 
t  Also  called  a  generator,  because,  as  will  presently  appear,  the  whole  surface  may 
be  generated  by  the  motion  of  such  a  line. 
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That  all  these  three  cases  are  possible  is  shown  by  simple  exam- 
ples ;  for  instance,  in  the  case  of  the  surface 

the  three  coordinate  axes  illustrate  the  three  cases. 

We  shall  often  find  it  convenient  to  say  that  a  tangent  line  meets 
the  quadric  in  two  coincident  points. 

From  the  proof  we  have  given  of  Theorem  1,  we  can  also  infer 
the  further  result : 

Theorem  2.     If  (y^,  y^^  y^^  y^  is  a  point  on  the  quadric  (1),  then  if 

4 

(3)  2:aiyiri^y=0,* 

every  line  through  y  is  either  a  tangent  or  a  ruling  q/  (1),  otherwise 
every  line  through  y  which  lies  in  the  plane 

4 

(4)  ^ctijXi^j  =  0 

is  a  tangent  or  ruling  of  (1),  while  every  other  line  through  y  is  a  secant. 

A  theorem  of  fundamental  importance,  which  follows  immediately 
from  this,  is : 

Theorem  3.  If  there  exists  a  point  y  on  the  quadric  (1)  such  that 
the  identity  (3)  is  fulfilled.,  then  (1)  is  a  cone  with  y  as  a  vertex  ;  and^  con- 
versely., if  {l)is  a  cone  with  y  as  a  vertex,  then  the  identity  (3)  is  fulfilled. 

We  pass  now  to  the  subject  of  tangent  planes,  which  we  define 
as  follows : 

Definition  2.  A  plane  p  is  said  to  be  tangent  to  the  quadric  (1) 
at  one  of  its  points  P,  if  every  line  of  p  which  passes  through  P  is  either  a 
tangent  or  a  ruling  of  (V). 

It  will  be  seen  that,  according  to  this  definition,  if  (1)  is  a  cone,  every 
plane  through  a  vertex  of  (1)  is  tangent  to  (1)  at  this  vertex.  We  have 
thus  included  among  the  tangent  planes,  planes  which  in  ordinary 
geometric  parlance  would  not  be  called  tangent.  The  same  objection 
applies  to  our  definition  of  tangent  lines.  We  therefore  now  intro- 
duce the  distinction  between  true  tangent  lines  or  planes  and  pseudo- 
tangent  lines  or  planes. 

Definition  3.  A  line  or  plane  which  touches  a  quadric  surface  at 
a  point  which  is  not  a  vertex  is  called  a  true  tangent ;  all  other  tangent 
lines  and  planes  are  called  pseudo-tangents. 

*  It  should  be  noticed  that,  on  account  of  the  relation  aij  =  %»,  'SaijXiyj  =  'LaijyiXj. 
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EXERCISES 

1.  Prove  that  if  P  is  a  point  on  a  quadric  surface  S,  which  is  not  a  vertex, 
and  p  the  tangent  plane  at  this  point,  one  of  the  following  three  cases  must  occur  : 

(a)    Two,  and  only  two,  lines  of  p  are  rulings  of  S,  and  these  rulings  intersect 

at  P. 

(6)  One,  and  only  one,  line  of  jo  is  a  ruling  of  5,  and  this  ruling  passes 
.through  P. 

(c)    Every  line  of  p  is  a  ruling  of  S. 

2.  Prove  that 

(a)  When  case  (a)  of  Exercise  1  occurs,  the  quadric  surface  is  not  a  cone ; 
and,  conversely,  if  the  quadric  surface  is  not  a  cone,  case  (a)  will  always  occur. 

(b)  If  case  (b)  of  Exercise  1  occurs,  p  is  tangent  to  S  at  every  point  of  the 
ruling  which  lies  in  p. 

(c)  If  case  (b)  of  Exercise  1  occurs,  »S  is  a  cone  with  one,  and  only  one,  vertex, 
and  this  vertex  is  on  the  ruling  which  lies  in  p ;  and  conversely,  if  5  is  a  cone  with 
one,  and  only  one,  vertex,  case  (b)  will  always  occur. 

(d)  If  case  (c)  of  Exercise  1  occurs,  there  is  a  line  Z  in  jt>  every  point  of  which 
(but  no  other  point)  is  a  vertex  of  S;  and  ^S"  consists  of  two  planes  one  of  which 
is  p,  while  the  other  intersects  it  in  Z. 

39.  Conjugate  Points  and  Polar  Planes.  Two  points  are  com- 
monly said  to  be  conjugate  with  regard  to  a  quadric  surface 

4 

(1)  ^^ijXiXj  =  0, 

when  they  are  divided  harmonically  by  the  points  where  the  line 
connecting  them  meets  the  surface.  In  order  to  include  all  limiting 
cases,  we  frame  the  definition  as  follows: 

Definition.  Two  distinct  points  are  said  to  he  conjugate  with  re- 
gard, to  the  surface  {V)  if 

(a)  The  line  joining  them  is  a  tangent  or  a  secant  to  (1),  and  the 
points  are  divided  harmonically  hy  the  points  where  this  line  meets  (1)  ;  or 

(b)  The  line  joining  them  is  a  ruling  of  (1), 

Two  coincident  points  are  called  conjugate  if  they  both  lie  on  (1). 

Let  the  coordinates  of  the  points  be  {y^^  y^-,  ^3,  y^)  and  (z^,  z^,  Sg,  z^), 
and  let  us  look  first  at  the  case  in  which  the  points  are  distinct  and 
neither  of  them  lies  on  (1),  and  in  which  the  line  connecting  them  is 
a  secant  of  (1).  The  points  of  intersection  of  the  line  yz  with  (1) 
may  therefore  be  written 

(^1  +  \yi,  z^  -f  \y2^  23  +  \,  ?/3,  24  +  \,  y^)  ({ =  1,  2), 


122  INTRODUCTION   TO   HIGHER  ALGEBRA 

where  \  and  Xg  are  the  roots  of  Equation  (2),  §  38.  A  necessary 
and  sufficient  condition  for  harmonic  division  is  that  the  cross-ratio 
Xj  /  ^2  l^^ve  the  value  —  1  ;  that  is  X ^  +  Xg  =  ^  5  ^^^  referring  back 
to  Equation  (2),  §  38, 


4 


(2)  2a,y^,2,.  =0. 

We  leave  it  for  the  reader  to  show  that  in  all  other  cases  in  which  ^ 
and  z  are  conjugate  this  relation  (2)  is  fulfilled  ;  and  that,  conversely, 
whenever  this  condition  is  fulfilled,  the  points  are  conjugate.    That  is : 

Theorem  1.  A  necessary/  and  sufficient  condition  that  the  points 
y,  z  be  conjugate  with  regard  to  (1)  is  that  (2)  he  fulfilled. 

This  theorem  enables  us  at  once  to  write  down  the  equation  of 
the  locus  of  the  point  x  conjugate  to  a  fixed  point  y,  namely, 

4 

(3)  Ijii^x^yj^O, 

Except  when  the  first  member  of  this  equation  vanishes  identically, 
this  locus  is  therefore  a  plane  called  the  polar  plane  of  the  point  y. 
We  saw  in  the  last  section  that  the  first  member  of  (3)  vanishes 
identically  when  (1)  is  a  cone  and  y  is  a  vertex.  This  is  the  only 
case  in  which  it  vanishes  identically  ;  for,  if  y  is  any  point,  not  a 
vertex,  on  a  quadric  surface,  (3)  represents  the  tangent  plane  at  that 
point;  while  if  y  is  not  on  (1),  the  first  member  of  (3)  can  clearly  not 
vanish  identically,  since  it  does  not  vanish  when  the  a;'s  are  replaced 
by  the  i/'s.     Hence  the  theorem  : 

Theorem  2.  If  (1)  is  not  a  cone^  every  point  y  has  a  definite  polar 
plane  (3)  ;  if  (\)  is  a  cone,  every  point  except  its  vertices  has  a  definite 
polar  plane  (3),  while  for  the  vertices  the  first  member  o/"  (3)  is  identi- 
cally zero. 

We  note  that  the  property  that  a  plane  is  the  polar  of  a  given 
point  with  regard  to  a  quadric  surface  is  a  projective  property,  since 
a  collineation  of  space  evidently  carries  over  two  conjugate  points 
into  points  conjugate  with  regard  to  the  transformed  surface. 

Theorem  3.  If  two  points  P^  and  P^  are  so  situated  that  the 
polar  plane  of  P^  passes  through  P^-,  then,  conversely,  the  polar  plane 
of  Pg  ^^^^  pass  through  P^. 
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For,  from  the  hypothesis,  it  follows  that  P^  and  P^  are  conjugate 
points,  and  from  this  the  conclusion  follows. 

40.  Classification  of  Quadric  Surfaces  by  Means  of  their  Rank. 

Theorem  2  of  the  last  section  may  be  stated  by  saying  that  a  neces- 
sary and  sufficient  condition  that  the  quadric  surface 

(1)  ^dijXiXj  =  0 

be  a  cone  and  that  (^j,  1/2^  y^^  y^  be  its  vertex  (or  one  of  its  vertices) 
is  that 

(2)  2a,;^.^,.y,.  =  0. 

This  identity  (2)  is  equivalent  to  the  four  equations 


(3) 


«21^1  +  ^22^2  +  H^VZ  +  ^24^4  =  ^' 
^81^1  +  ^32^/2  +  ^33^3+  ^34^4  =  ^' 
^41^1  +  ^42^2  +  ^43^3+  ^44^4  =  ^• 


A  necessary  and  sufficient  condition  for  this  set  of  equations  to 
have  a  common  solution  other  than  (0,  0,  0,  0)  is  that  the  determi- 
nant of  their  coefficients  be  zero.  We  notice  that  this  determinant 
i3  the  discriminant  a  of  the  quadric  surface.     Hence, 

Theorem  1.  A  necessary  and  sufficient  condition  for  a  quadric 
surface  to  he  a  cone  is  that  its  discriminant  vanish. 

If,  then,  the  rank  of  the  quadric  surface  is  four^  the  surface  is  not 
a  cone. 

If  the  rank  is  three^  the  set  of  equations  (3)  has  one,  and,  except 
for  multiples  of  this,  only  one,  solution.  Hence  in  this  case  the  sur- 
face is  an  ordinary  cone  with  a  single  vertex. 

If  the  rank  is  two^  equations  (3)  have  two  linearly  independent  solu- 
tions (cf.  §18),  on  which  all  other  solutions  are  linearly  dependent. 
Hence  in  this  case  the  surface  is  a  cone  with  a  whole  line  of  vertices. 

If  the  rank  is  one.,  equations  (3)  have  three  linearly  independent 
solutions  on  which  all  other  solutions  are  linearly  dependent.  Hence 
we  have  a  cone  with  a  whole  plane  of  vertices. 

If  the  rank  is  zero  we  have,  strictly  speaking,  no  quadric  surface ; 
but  the  locus  of  (1)  may  be  regarded  as  a  cone,  every  point  in  space 
being  a  vertex. 
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It  is  clear  that  the  property  of  a  quadric  surface  being  a  cone  is 
a  projective  property  ;  and  the  same  is  true  of  the  property  of  a  point 
being  a  vertex  of  a  cone.  Hence  from  the  classification  we  have 
just  given  we  infer 

Theorem  2.  The  rank  of  a  quadric  surface  is  unchanged  hy  non- 
singular  coUineations, 

EXERCISES 

1.  Definition.  If  a  plane  p  is  the  polar  of  a  point  P  with  regard  to  a  quadric 
surface,  then  P  is  called  a  pole  of  p. 

Prove  that  if  the  quadric  surface  is  non-singular,  every  plane  has  one,  and  only 
one,  pole. 

2.  Prove  that  if  the  quadric  surface  is  a  cone,  a  plane  which  does  not  pass 
through  a  vertex  has  no  pole. 

What  can  be  said  here  about  planes  which  do  pass  through  a  vertex  ? 

41.  Reduction  of  the  Equation  of  a  Quadric  Surface  to  a  Normal 
Form.  Since  cross-ratio  is  invariant  under  a  non-singular  coUinea- 
tion,  a  quadric  surface  aS',  a  point  P,  not  on  S,  and  its  polar  plane 
with  regard  to  S,  are  carried  over  by  any  non-singular  collineation 
into  a  quadric  surface  S'^  a  point  P',  and  its  polar  plane  with  regard 

4 

to  S'.     A  point  (?/i,  ^2^  I/b")  V^  ^^^  on  the  quadric  surface  ^a^jX^Xj-  =  0, 

4  ^ 

cannot  be  on  its  own  polar  plane  ^a^jX^yj  =  0  as  we  see  by  replacing 

the  rr's  in  this  last  equation  by  the  y's.  Now  transform  by  a  colline- 
ation so  that  this  point  becomes  the  origin  and  its  polar  plane  the 
plane  at  infinity.*  The  quadric  surface  will  now  be  a  central  quad- 
ric with  center  at  the  origin,  since,  if  any  line  be  drawn  through  the 
origin,  the  two  points  in  which  this  line  meets  the  surface  are  divided 
harmonically  by  the  origin  and  the  point  at  infinity  on  this  line. 

The  equation  of  the  polar  plane  of  the  point  (3/1,  ^31  yz->  ^4)  with 
regard  to  the  transformed  quadric 

4 

^a^jX^Xj  —  v 

4 
is  la'ijx'iyj  —  0, 

*  Such  a  collineation  can  obviously  be  determined  in  an  infinite  number  of  ways 
by  means  of  the  theorem  that  there  exists  a  collineation  which  carries  over  any  five 
linearly  independent  points  into  any  five  linearly  independent  points ;  cf.  Exercises  2, 
3,  §  24. 
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which  reduces  to  the  simple  form 

when  the  point  is  the  origin  (0,  0,  0,  1).     For  this  equation  to  rep- 
resent the  plane  at  infinity,  we  must  have 

^14  =  ^24  =  ^4  =  ^'  ^44  ^^' 

Hence  the  quadric  surface  becomes 

+  <^21  "^2  *^1  "■    ^22  *^2  "r"  ^23  *^2  '^  3 

+  6lg2  ^3  ^'i  H~  ^32  *^3  -^2    '    ^33  *^3"' 

A  slightly  different  reduction  can  be  performed  by  transforming 
the  point  (y^,  ^/g,  ^^3,  y^)  to  the  point  at  infinity  on  the  rr^-axis  and  its 
polar  plane  to  the  x^x^-iplsine.  It  is  easy  to  see  that  we  thus  get  rid 
of  the  terms  containing  x^  except  the  square  term. 

Similarly  we-  can  get  rid  of  the  terms  containing  rrg  and  x^. 
Thus  we  see  that  an^  quadric  surface  can  be  reduced  by  a  collineation 
to  a  form  where  its  equation  contains  no  term  in  x^  except  the  term  in 
xf  whose  coefficient  then  is  not  zero. 

According  as  we  take  for  i  the  values  1,  2,  3,  4,  we  get  thus  four 
different  normal  forms  for  the  equation  of  our  quadric  surface,  and 
inasmuch  as  each  of  these  forms  can  be  obtained  in  a  great  variety 
of  ways,  the  question  naturally  arises  whether  we  cannot  perform 
all  four  reductions  simultaneously.  That  this  can,  in  general,  be 
done  may  be  seen  as  follows :  let  y  be  a  point  not  on  the  quadric 
surface,  and  z  aiiy  point  on  the  polar  plane  of  «/,  but  not  on  the 
quadric  surface.  Its  polar  plane  contains  y.  Let  w  be  any  point  on 
the  intersection  of  the  polar  planes  of  y  and  2,  but  not  on  the  quadric 
surface.  Then  its  polar  plane  passes  through  1/  and  z.  These  three 
polar  planes  meet  in  some  point  w,  and  it  is  readily  seen  that  the  four 
points  ?/,  2,  w^  u  do  not  lie  on  a  plane.  The  tetrahedron  yzwu  is  called 
a  polar  or  self  conjugate  tetrahedron  of  the  quadric  surface,  since  it 
has  the  property  that  any  vertex  is  the  pole  of  the  opposite  face. 

If  we  transform  the  four  points,  y,  2,  w^  u  to  the  origin  and  the 
points  at  infinity  on  the  three  axes,  the  effect  will  be  the  same  as  that 
of  the  separate  transformations  above,  that  is,  the  equation  of  the 
quadric  surface  will  be  reduced  to  the  form 

^11  *^l     '    ^22  '^2    '•"  ^33  *^3     '    ^44  "^4  ^ 
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We  have  tacitly  assumed  that  it  is  possible  to  find  points  ?/,  2?,  w^ 
constructed  as  indicated  above,  and  not  lying  on  the  quadric  surface. 
We  leave  it  for  the  reader  to  show  that,  if  the  quadric  surface  is  not 
a  cone,  this  will  always  be  possible  in  an  infinite  number  of  ways. 
A  cone,  however,  has  no  self-conjugate  tetrahedron,  and  in  this  case 
the  above  reduction  is  impossible. 

EXERCISES 

1.  Prove  that  if  the  discriminant  of  a  quadric  surface  is  zero,  the  equation  of 
the  surface  can  always  be  reduced,  by  a  suitable  collineation,  to  a  form  in  which 
the  coordinate  x^  does  not  enter. 

[Suggestion.  Show,  by  using  the  results  of  this  chapter,  that  if  the  vertex  of  a 
quadric  cone  is  at  the  origin,  au  =  ^24  =  «34  =  «44  =  0.] 

2.  Show  that,  provided  the  cone  has  a  finite  vertex,  the  collineation  of 
Exercise  1  may  be  taken  in  the  form 

X2  =  ^2  +  p  X^, 
Xg  —  X<j  -\-  "Y  x^, 

x^  =  x^. 
[Suggestion.     Use  non-homogeneous  coordinates.] 


CHAPTER   X 

QUADRATIC   FORMS 

42.  The  General  Quadratic  Form   and  its    Polar.      The   general 
quadratic  form  in  n  variables  is 

n 
{jLy  ^ddijX^Xj  =  Ct-^^X^       -f-  ^j2^i*^2  "i~  •  •  •  "I    ^\n-^\-^n 

"T"  ^21'^2*^1  "I     ^22*^2  I     •  •  •     i~  ^2w*^2*^« 


"T    ^nl*^n*^l  "T"  ^n2^'n^2    i    •  •  •    "~  ^nrv^n') 


■n 

where  a^  =  a^i.*    The  bilinear  form  ^a^yiZj  is  called  the  polar  form  qf 
(1).     Subjecting  (1)  to  the  linear  transformation 

X^  =  CiiXi   4"  •  •  •  +  ^Irv^n^ 


X^  —  ^n\^l    I     •  •  •     I    Cfirv^rtt 

we  get  a  new  quadratic  form 

n 

The  polar  form  of  (2)  is  ^a[jy[z'j.     If  we  transform  the  2/'s  and  s's  of 
the  polar  form  of  (1)  by  the  same  transformation  c,  we  get  a  new 

n 

bilinear  form  ^aijy'iZj.     We  will  now  prove  that  a^^  =  aj^. 
We  have  the  identities 

n  n 

(4)  '2aijyiZj='La,jy[z^.. 

*  It  should  be  clearly  understood  that  this  restriction  is  a  matter  of  convenience, 
not  of  necessity.  If  it  were  not  made,  the  quadratic  form  would  be  neither  more  nor 
less  general. 
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Each  of  these  we  may  regard  as  identities  in  the  a^^'s,  y^'s,  2"s,  the 
rr's,  ^'s,  z'a  being  merely  abbreviations  for  certain  polynomials  in  the 
corresponding  primed  letters.  The  last  written  identity  reduces, 
when  we  let  ?/J  =  z'i  —  x[  (i  =  1,  2, . . .  n\  to 

n  n  _ 

Combining  this  with  (8)  gives 

n  _  n 


Hence 


%  =  (^ii  and  a,y  +  %i  =  a;y  +  4»- 


We  have  assumed  that  a!^^=.a!^^^  these  being  merely  the  coefficients  of 
a  certain  quadratic  form,  and  we  proved,  in  Theorem  4,  §  36,  that 


a^j  —  a^i.     Hence  we  infer  that  a^  =  a[j. 


From  this  fact  and  from  (4)  we  get  at  once  the  further  result : 

n  n 


That  is  : 

Theorem.    The  polar  form 


^<^i0i^j 


is  an  absolute  covariant  of  the  system  composed  of  the  quadratic  form 

n 

z^a^jX^Xj 
and  the  two  points  (?/i, . . .  y„),  (^j, . . .  2„). 


43.   The  Matrix  and  the  Discriminant  of  a  Quadratic  Form. 

Definition.    The  matrix 

a,-,  ...  a. 


a  = 


11 


Iw 


^nl  '  •  *  ^nn 


is  called  the  matrix  of  the  quadratic  form 


(1) 


^a^jX^Xj. 


The  determinant  of  a  is  called  the  discriminant  of  {!) ;  and  the  rank  of 
a,  the  rank  of  (1).      If  the  discriminant  vanishes,  (1)  is  called  singular. 
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The  matrix  of  (1)  is  the  matrix  of  its  polar  form.  Moreover,  as 
was  shown  in  the  last  section,  if  the  x's  in  (1)  are  subjected  to  a 
linear  transformation,  and  the  ^'s  and  z's  in  the  polar  of  (1)  are  sub- 
jected to  the  same  transformation,  the  matrix  of  the  new  quadratic 
form  will  be  the  same  as  the  matrix  of  the  new  bilinear  form.  But 
we  saw,  in  Theorem  1,  §36,  how  the  matrix  of  a  bilinear  form  is 
changed  by  linear  transformations  of  the  variables.  Thus  we  have 
the  theorem  : 

Theorem  1.  If  in  the  quadratic  form  (1)  with  matrix  a  we  sub- 
ject the  x's  to  a  linear  transformation  with  matrix  c,  we  obtain  a  new 
quadratic  form  with  matrix  c'ac,  where  d  is  the  conjugate  of  c. 

From  this  there  follow  at  once,  precisely  as  in  §  36,  the  further 
results  : 

Theorem  2.  The  rank  of  a  quadratic  form  is  not  changed  by  non- 
singular  linear  transformation. 

Theorem  3.  The  discriminant  of  a  quadratic  form  is  a  relative 
invariant  of  weight  two, 

44.   Vertices  of  Quadratic  Forms. 

Definition.     By  a  vertex  of  the  quadratic  form 

1 

we  understand  a  point  (c^,  •  •  •  a„)  where  the  c^s  are  not  all  zero,  such  that 

n 

(2)  ^a^jXiCj  =  0. 

A  quadratic  form  clearly  vanishes  at  all  of  its  vertices. 

It  is  merely  another  way  of  stating  this  definition  when  we  say : 

Theorem  1.  A  necessary  and  sufficient  condition  that  ((?p  •••  <?„)  be 
a  vertex  of  (V)  is  that  it  be  a  solution,  not  consisting  exclusively  of  zeros, 
of  the  system  of  equations 

(3)  :  :  :  :  :  :  ;  : 

^nl^l  "f"   •  •  *  ~f"  ^nn^n  ==  ^* 
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Since  the  resultant  of  (3)  is  the  discriminant  of  (1),  we  may  add : 

Theorem  2.  A  necessary  and  sufficient  condition  for  a  quadratic 
form  to  have  a  vertex  is  that  its  discriminant  he  zero  ;  and  if  the  rank 
of  the  form  is  r,  it  has  n—r  linearly  independent  vertices^  and  every  point 
linearly  dependent  on  these  is  a  vertex. 

In  particular,  we  note  that  if  the  discriminant  of  a  quadratic 
form  is  zero  and  if  the  cofactors  of  the  elements  of  this  determinant 
are  denoted  in  the  ordinary  way  by  Ai^  then  (^^i,  •  •  •  Ai^  is  a  vertex, 
provided  all  these  A's  are  not  zero. 

The  following  identity  is  of  great  importance  (cf.  formula 
(2),  §  38), 

n  n  n  n 

(4)  ^a^j  {Zi  +  \y>)  {zj  +  \yj)  =  ^a^jZfj  +  2  Xla^  z^yj  +  X^^a^^y^y^. 
This  may  be  regarded  as  an  identity  in  all  the  letters  involved. 

n 

If  (cy,  •••<?„)  is  a  vertex  of  the  quadratic  form  ^a^x^Xj^  and  these 

<?'s  are  substituted  in  (4)  in  place  of  the  ^'s,  the  last  two  terms  of 
the  second  member  of  this  identity  are  zero,  and  we  have 

n  n 

(5)  Say  {Zi  +  \c^  {zj  +  Xcj)  =  S  ayZiZj ; 

and  conversely,  if  (5)  holds,  (<?p  •••  ^„)  is  a  vertex;  for  subtracting 
(5)  from  (4),  after  substituting  the  cs  for  the  y's  in  (4),  we  have 

n  n 

2  X^a^jZiCj  +  X^'Eajj  CiCj  =  0, 
and,  this  being  an  identity  in  X  as  well  as  in  the  z's,  we  have 

n 

^^  a^jZjOj  —~-  \j . 

Thus  we  have  proved  the  following  theorem: 

Theorem  3.  A  necessary  and  sufficient  condition  that  (<?i,  •••<?«) 
he  a  vertex  of  the  quadratic  form  (1)  is  that  Zy,  •  •  •  z^  and  X  heing  in- 
dependent variahles^  the  identity  (5)  he  fulfilled. 

EXERCISES 

1.  Prove  that  if  (c^, . . .  O  is  a  vertex  of  (1),  and  (y^, . . .  yn)  is  any  point  at 
which  the  quadratic  form  vanishes,  then  the  quadratic  form  vanishes  at  every 
point  linearly  dependent  on  c  and  y. 

2.  State  and  prove  a  converse  to  1. 
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45.   Reduction  of  a  Quadratic  Form  to  a  Sum  of  Squares.      If  in 

the  quadratic  form 

(1)  </)  (^1,  •  •  •  ^n)  =  ^(Ifj^i^J 

the  coefficient  a^i  is  not  zero,  we  may  simplify  the  form  by  the  fol- 
lowing transformation  due  to  Lagrange. 
The  difference 

(j>  (x^,  '"X^) (a^^x^  H h  a.i^x^y 

is  evidently  independent  of  Xi.     Denoting  it  by  (^^  we  have 
If,  then,  we  perform  the  non-singular  linear  transformation 

X^  =  Clj^  X-^  -\-  Clj^Xf^  "T"  •  •  *  ~r  ^in^n 
2  —  *^2 


(2) 


*C/*3  ■"■"  *M// 


nf 


the  quadratic  form  </>  is  reduced  to  the  form 

(3)  -xf  +  <f>i{4*-'0, 

in  which  all  the  terms  in  x[  are  wanting  except  the  term  in  x'^. 

It  will  be  seen  that  this  reduction  can  in  general  be  performed  in 
a  variety  of  ways.  It  becomes  impossible  only  when  the  coefficients 
of  all  the  square  terms  in  the  original  quadratic  form  are  zero. 

Unless,  in  the  new  quadratic  form  c^j,  the  coefficients  of  all  the 
square  terms  are  zero,  we  can  apply  the  same  reduction  to  this  form 
by  subjecting  the  variables  x!^^  •••  x^  to  a  suitable  non-singular  linear 
transformation.  This  transformation  may  also  be  regarded  as  a  non- 
singular  linear  transformation  of  all  the  x''^  :  {x[^  o^g,  •••  x'^)  if  we  write 
x1  =  Xy     We  thus  reduce  (3)  to  the  form 

(4)  l<2  +  J_^^/2+^^(^//,...^;;). 

Applying  this  reduction  now  to  c^g^  and  proceeding  as  before,  we 
see  that  by  a  number  of  successive  non-singular  transformations  the 
form  (f>  can  finally  be  reduced  to  the  form  ; 

(5)  c^xj  +  e^xl^  "'-{-c^xl 
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These  successive  transformations  can  now  be  combined  into  a  single 
non-singular  linear  transformation,  and  we  are  thus  led  to  the 

Theorem.     Every  quadratic  form  in  n  variables  can  he  reduced  to 
the  form  (5)  hy  a  non-singular  linear  tra7isformation. 

The  proof  of  this  theorem  is  not  yet  complete  ;  for  if  at  any 
stage  of  the  reduction  the  quadratic  form  <^^  has  the  peculiarity  that 
all  its  square  terms  are  wanting,  the  next  step  in  the  reduction  will  be 
impossible  by  the  method  we  have  used.  Before  considering  this  point, 
we  will  illustrate  the  method  of  reduction  by  a  numerical  case. 
Example. 

2a;f+     x^x^^  +  '^x-^x^' 

=  l{2x^^-x^  +  ^x^Y-^<i>^ 


where 


~\~  Xn  X-i  O    Xn  ~t~    t7  Xn  Xn 

+  8  x^x^  +  9  x^x^  -\-2  x^ 


i      '^  3     2  3 

Accordingly,  by  means  of  the  non-singular  linear  transformation 

'x[  =  2x-^-\-  x^   +8  a^g, 


3' 

the  form  (f)  reduces  to     ^x'^  —  ^x^^  —  ^^^ x'^. 

We  have  given  here  merely  one  method  of  reduction.  Three  differ- 
ent methods  were  open  to  us  at  the  first  step  and  two  at  the  second. 

We  proceed  now  to  complete  the  proof  of  the  general  theorem. 
Let  us  suppose  that  the  coefficients  of  all  the  square  terms  in  <^  are 
zero,*  but  that  a^^^  ^  0.     Then 

^(^1,  "•X^=2  ^12^1^2  +  ^^l  («13^3  + f-  Hn^n) 

+  ^  ^2x^23*^3    i~  *  ■  ■  "I    ^2»*^n  /   I    ^^ij^i^j 

3 

2 

—  V^12*^2  ~^~  ^IZ'^Z  "t"  ' " '  "T  ^in^n)  V^21*^l    •"  ^23'^3  "i~  "  '  "i     ^2»'^n) 

2  ^ 

where      <^i  = (^13^3  H f-  ^m^n)  (^23^3  "I 1"  ^2«^n)  +  ^ctij^iXj. 

Hi  ^ 

*  This  method  may  be  used  whenever  aii  =  a22  =  0  whether  all  the  other  coef- 
ficients an  are  zero  or  not. 
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The  non-singular  linear  transformation 

^2  ^^  ^21*^1  "■"  ^23*^3    I     "  ■  "I    ^2»*^» 


•C'r 


"     •*^«    ^" 


thus  reduces  <j)  to  the  form 


^1  ^2  ''"  T^l\*^3'  *  * '  *^«/* 


12 


The  further  non-singular  transformation 

1  —      1     "^**'2' 


•^2  —     1  2' 


a: 


3' 


2?* 


^« 


reduces  <^  to  the  form 

1 


2.  4^-o^<^+</>i(4^-^:;)- 

^  «12  ^  ^12 

The  above  reduction  was  performed  on  the  supposition  that  ai2=?^0. 
It  is  clear,  however,  that  only  a  slight  change  in  notation  would  be 
necessary  to  carry  through  a  similar  reduction  if  c^i^—  0  but  ciij^  0. 
The  only  case  to  which  the  reduction  does  not  apply  is,  therefore, 
the  one  in  which  all  the  coefficients  of  the  quadratic  form  are  zero, 
a  case  in  which  no  further  reduction  is  necessary  or  possible. 

We  thus  see  that  whenever  Lagrange's  reduction  fails,  the  method 
last  explained  will  apply,  and  thus  our  theorem  is  completely  es- 
tablished. 

EXERCISES 

1.  Given  a  quadratic  form  in  which  n  =  5  and  ay  =  \i  —j\.  Reduce  to  the 
form  (5). 

2.  Reduce  the  quadratic  form 

9  a;2  -  6  2/2  -  8  22  +  6x3^  -  14  xz  -\-  IS  xw  +  S  yz  -\- 12  yw  -  4:  zw 
to  the  form  (5). 
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3.  Prove  that  if  (z/j, ...  z/n)  is  any  point  at  which  a  given  quadratic  form  is 
not  zero,  a  linear  transformation  can  be  found  (and  that  in  an  infinite  number  of 
ways)  which  carries  this  point  into  the  point  (0,  •••  0,  1)  and  its  polar  into  kx^'^ 
and  show  that  this  linear  transformation  eliminates  from  the  quadratic  form  all 
terms  in  x^  except  the  term  in  a;^  which  then  has  a  coefficient  not  zero. 

4.  Prove  that  the  transformations  described  in  Exercise  3  are  the  only  ones 
which  have  the  effect  there  described. 

5.  Show  how  the  two  methods  of  reduction  explained  in  this  section  come  as 
special  cases  under  the  transformation  of  Exercise  3. 


46.  A  Normal  Form,  and  the   Equivalence   of   Quadratic   Forms. 

In  the  method  of  reduction  explained  in  the  last  section,  it  may 
happen  that,  after  we  have  taken  a  number  of  steps,  and  thus 
reduced  <^  to  the  form 


c^x\  + 


+  ^k^k  +  ^k  \^k+V    '"  ^n)f 


the  form  cj)^  is  identically  zero.  In  this  case  no  further  reduction 
would  be  necessary  and  the  form  (5)  of  the  last  section  to  which  </>  is 
reduced  would  have  the  peculiarity  that  Cf^^^ 
while  all  the  earlier  e's  are  different  from  zero, 
when  this  case  will  occur. 

For  this  purpose,  consider  the  matrix 


—  ^k+2  — 


=  ^n  =  0, 
It  is  easy  to  see  just 


€^     0    •••  0 
^2  -  0 


0 


0      0    ...  €. 


of  the  reduced  form  (5)  of  §  45.  It  is  clear  that  the  rank  of  this 
matrix  is  precisely  equal  to  the  number  of  e's  different  from  zero  ; 
and,  since  the  rank  of  this  reduced  form  is  the  same  as  that  of  the 
original  form,  we  have  the  result : 

Theorem  1.     A  necessary  and  sufficient  condition  that  it  he  possi- 
ble to  reduce  a  quadratic  form  by  means  of  a  non-singular  linear  traris- 
formation  to  the  form 
(1)  ^  c^a^f  4-  •••  +  c^x% 


where  none  of  the  cs  are  zero^  is  that  the  rank  of  the  quadratic  form  be  r. 
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This  form  (1)  involves  r  coefficients  c^,  •••  c^.  That  the  values  of 
these  coefficients,  apart  from  the  fact  that  none  of  them  are  zero,  are 
immaterial  will  be  seen  if  we  consider  the  effect  on  (1)  of  the  trans- 
formation r  rr- 

^1 


(2) 


=  -  <^^-x' 


r 


ni 


where  hy,  •••  h^  are  arbitrarily  given  constants  none  of  which,  how- 
ever, is  to  be  zero.  The  transformation  (2)  is  non-singular,  and 
reduces  (1)  to  the  form 

(3)  \x^^+  •••  +A^^4^. 

Thus  we  have  proved 

Theorem  2.  A  quadratic  form  of  rank  r  can  he  reduced  hy  means 
of  a  non- singular  linear  transformation  to  the  form  (3),  where  the  values 
of  the  constants  ky,  •  •  •  k^.  may  he  assigned  at  pleasure  provided  none  of 
them  are  zero. 

If,  in  particular,  we  assign  to  all  the  A;'s  the  value  1,  we  get 

Theorem  3.  Every  quadratic  form  of  rank  r  can  he  reduced  to 
the  normal  form 

(4)  x\+  ...  +:r2 

hy  means  of  a  non-singular  linear  transformation. 

From  this  follows 

Theorem  4.  A  necessary  and  sufficient  condition  that  two  quad- 
ratic forms  he  equivalent  with  regard  to  non-singular  linear  trans- 
formations is  that  they  have  the  same  rank. 

That  this  is  a  necessary  condition  is  evident  from  the  fact  that  the 
rank  is  an  invariant.  That  it  is  a  sufficient  condition  follows  from 
the  fact  that,  if  the  ranks  are  the  same,  both  forms  can  be  reduced 
to  the  same  normal  form  (4). 
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The  normal  form  (4)  has  no  special  advantage,  except  its  sym- 
metry, over  any  other  form  which  could  be  obtained  from  (3)  by 
assigning  to  the  Fs  particular  numerical  values.  Thus,  for  instance, 
a  normal  form  which  might  be  used  in  place  of  (4)  is 

0^2     I       ,  ,  ,      I     /v'2         __  ncii 

.</j  -p  -p  -^^r-X        **'r* 

This  form  would  have  the  advantage,  in  geometrical  work,  of  giving 
rise  to  a  real  locus. 

Finally  we  note  that  the  transformations  used  in  this  section  are 
not  necessarily  real,  even  though  the  form  we  start  with  be  real. 

EXERCISE 

Apply  the  results  of  this  section  to  the  study  of  quadric  surfaces. 

47.  Reducibility.  A  quadratic  form  is  called  reducible  when  it  is 
identically  equal  to  the  product  of  two  linear  forms,  that  is,  when 

n 
(1)  S  aiPiXj  =  {h^X^  +  ^22^2  H 1-  K^n){<^\^l  +  ^2^2  +  '  * '  +  ^n^n)' 

Let  us  seek  a  necessary  and  sufficient  condition  that  this  be  the  case. 
We  begin  by  supposing  the  identity  (1)  to  hold,  and  we  consider  in 
succession  the  case  in  which  the  two  factors  in  the  right-hand  mem- 
ber of  (1)  are  linearly  independent,  and  that  in  which  they  are  pro- 
portional. In  the  first  case  the  5's  are  not  all  proportional  to  the 
corresponding  6»'s,  and  by  a  mere  change  of  notation  we  may  insure 
5^,  \  not  being  proportional  to  e^,  c^.  This  being  done,  the  trans- 
formation 

'X[  =  h^X^  -f  ^2^2  -I ^  ^n^n 

•^2  ^^  ^1*^1  "^  ^2*^2  "i"  ' "  "I"  ^n^n 
X^=  X^  . 


K^n  —  ^n 


is  non-singular  and  carries  our  quadratic  form  over  into  the  form 


The  matrix  of  this  form  is  readily  seen  to  be  of  rank  2,  hence  the 
original  form  was  of  rank  2. 
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Turning  now  to  the  case  in  which  the  two  factors  in  (1)  are 
proportional  to  each  other,  we  see  that  (1)  may  be  written 

n 

S  a^XiXj  =  C{h^x-^  4-  •  •  •  +  ^n^nf  where  (7^  0. 

Unless  all  the  J's  are  zero  (in  which  case  the  rank  of  the  quadratic 
form  is  zero)  we  may  without  loss  of  generality  suppose  ^i=?£=0,  in 
which  case  the  linear  transformation 

x[  =  h^x^-\ \-hj,Xn 

^2  —  ^2 


^n  —  ^n 


will  be  non-singular  and  will  reduce  the  quadratic  form  to 

Cx'\ 
which  is  of  rank  1.  ^' 

Thus  we  have  shown  that  if  a  quadratic  form  is  reducible,  its 
rank  is  0,  1,  or  2.  We  wish  now,  conversely,  to  prove  that  every 
quadratic  form  whose  rank  has  one  of  these  values  is  reducible. 

A  quadratic  form  of  rank  zero  is  obviously  reducible. 

A  form  of  rank  1  can  be  reduced  by  a  non-singular  linear  trans- 
formation to  the  form  x'^^  that  is, 

n  2 

If  here  we  substitute  for  x[  its  value  in  terms  of  the  rr's,  it  is  clear 
that  the  form  is  reducible. 

A  form  of  rank  2  can  be  reduced  to  the  form  x[  +  x'^,  that  is, 

2  aijXiXj  =  x[  -\- x'^  =  (x[ -{-  V  —  1  xi^)(x[  —  V  —  1  x'2). 

Here  again,  replacing  x[  and  xl^  by  their  values  in  terms  of  the  a;'s, 
the  reducibility  of  the  form  follows.     Hence, 

Theorem.  A  necessary  and  sufficient  condition  that  a  quadratic 
form  be  reducible  is  that  its  ra-nk  be  not  greater  than  2. 

48.  Integral  Rational  Invariants  of  a  Quadratic  Form.  We  have 
seen  that  the  discriminant  a  of  a  quadratic  form  is  an  invariant  of 
weight  2.  Any  integral  power  of  a,  or  more  generally,  any  constant 
multiple  of  such  a  power,  will  therefore  also  be  an  invariant.  We 
will  now  prove  conversely  the 

Theorem.  Every  integral  rational  invariant  of  a  quadratic  form 
is  a  constant  multiple  of  some  power  of  the  discriminant. 
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Let  us  begin  by  assuming  that  the  quadratic  form 

n 

(1)  l^aijXiXj 

is  non-singular,  and  let  c  be  the  determinant  of  a  linear  transforma- 
tion which  carries  it  over  into  the  normal  form 

(2)  42  +  ^/2+...+^r2. 

Let  licLii-,  • '  •  c^nn)  ^^  ^^7  integral  rational  invariant  of  (1)  of  weight  /a, 
and  denote  by  k  the  value  of  this  invariant  when  formed  from  (2). 
It  is  clear  that  A?  is  a  constant,  that  is,  independent  of  the  coefficients 
a^of(l).     Then  Jc=e^I. 

Moreover,  the  discriminant  a  being  of  weight  2,  and  having  for  (2) 
the  value  1,  we  have  ^^  _  ^^ 

Raising  the  last  two  equations  to  the  powers  2  and  fi  respectively, 
we  get  ;^2  =  ^M/2^        1=^2^  a'*. 

From  which  follows 

(3)  I^  =  Ba^. 

This  formula  has  been  established  so  far  for  all  values  of  the 
coefficients  a^-  for  which  a  ^  0.  That  it  is  really  an  identity  in  the 
a,/s  is  seen  at  once  by  a  reference  to  Theorem  5,  §  2.  The  poly- 
nomial on  the  right-hand  side  of  (3)  is  of  degree  fi  in  a^^ ;  *  hence 
we  see  that  /x  must  be  an  even  number,  since  P  is  of  even  degree  in 
^jj.  Letting  /x  =  2  z^,  we  infer  from  (3)  (cf .  Exercise  1,  §  2)  that  one 
or  the  other  of  the  identities 

/  =  A'a*',      ^  /  =  —  ka"" 

must  hold,  and  either  of  these  identities  establishes  our  theorem. 

A  comparison  of  the  result  of  this  section  with  Theorem  4,  §  46 
will  bring  out  clearly  the  essential  difference  between  the  two  con- 
ceptions of  a  complete  system  of  invariants  mentioned  in  §  29.  It  will 
be  seen  that  the  rank  of  a  quadratic  form  is  in  itself  a  complete  sys- 
tem of  invariants  for  this  form  in  the  sense  of  Definition  2,  §  29  ; 
while  the  discriminant  of  the  form  is  in  itself  a  complete  system  in 
the  sense  of  the  footnote  appended  to  this  definition. 

*  We  assume  here  that  k^O,  as  otherwise  the  truth  of  the  theorem  would  be 
obvious. 
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49.  A  Second  Method  of  Reducing  a  Quadratic  Form  to  a  Sum  of 
Squares.  By  the  side  of  Lagrange's  method  of  reducing  a  quadratic 
form  to  a  sum  of  squares,  there  are  many  other  methods  of  accom- 
plishing the  same  result,  one  of  the  most  useful  of  which  we  pro- 
ceed to  explain.  It  depends  on  the  following  three  theorems.  The 
proof  of  the  first  of  these  theorems  is  due  to  Kronecker  and  estab- 
lishes, in  a  remarkably  simple  manner,  the  fact  that  any  quadratic 
form  of  rank  r  can  be  written  in  terms  of  r  variables  only,  a  fact 
which  has  already  been  proved  by  another  method  in  Theorem  1,  §  46. 

Theorem  1.    If  the  rank  of  the  quadratic  form 

n 

(1)  <i>{x^,  '  ■  •  x„)  =  ^a^-XiXj 

1 

is  r  >  0,  and  if  the  variables  x^,  •  •  •  x^  are  so  numbered  that  the  r-rowed 

determinant  in  the  upper  left-hand  corner  of  its  matrix  is  not  zero,* 

new  variables  x[,  ■■•  x^  can  be  introduced  by  means  of  a  non-singular 

linear  transformation  such  that 

..      x[  =  Xi  {i=zr-\-l,  "-n), 

and  such  that  (1)  reduces  to  the  form 

r 

This,  it  will  be  noticed,  is  a  quadratic  form  in  r  variables  in  which 
the  coefficients,  so  far  as  they  go,  are  the  same  as  in  the  given  form  (1). 

In  order  to  prove  this  theorem,  we  begin  by  finding  a  vertex 
(^r  '"  ^n)  of  ^^^6  form  (1)  by  means  of  Equations  (3),  §  44.  Since  the 
r-rowed  determinant  which  stands  in  the  upper  left-hand  corner  of 
the  matrix  of  these  equations  is^not  zero,  the  values  of  c^+j,  •••<?„  may 
be  chosen  at  pleasure,  and  the  other  c's  are  then  completely  deter- 
mined.    If  we  let  Cj,^^  =  (?^^2  =  •  •  •  =  ^ra-i  =0,  Cn  =  1,  we  get  a  vertex 

(tfp  •••c^,  0,  •••  0,  1). 

Using  this  vertex  in  the  identity  (5),  §  44,  we  have 

^ {x^  -h  \(?p  '••  x^-\-  \c^,  x^+^,  •  •  •  a7„_i,  Xn-{-X)=: <f){x^,  •  •  •  x„). 

If  we  let  \  =  —  a;„,  this  identity  reduces  to 

*  That  such  an  arrangement  is  possible  is  evident  from  Theorem  3,  §  20. 
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Accordingly,  if  we  perform  the  non-singular  linear  transf ormat  ion  * 
{x'i  =  Xi-CiX^  (*  =  1,  •••/•), 

[3^1  =  ^i  (f=r  +  l,  ...n), 

the  quadratic  form  (1)  reduces  to 

n-l 

<f>  (^1^  •  •  •  ^'n-i^  0)  =  "Eaijx'ix'j, 

This,  being  a  quadratic  form  in  n  —  1  variables  of  rank  r  and  so 
arranged  that  the  r-rowed  determinant  which  stands  in  the  upper 
left-hand  corner  of  its  matrix  is  not  zero,  can  be  reduced,  by  the 
method  just  explained,  to  the  form 

re— 2 


2.ClijX^  Xj , 


where  the  linear  transformation  used  is  non-singular  and  such  that 

di=^x[  (^  =  r-f  1,  •••  n  — 1). 

By  adding  the  formula       x^l  =  x^t 

we  may  regard  this  as  a  non-singular  linear  transformation  in  the  n 
variables.  This  transformation  may  then  be  combined  with  the  one 
previously  used,  thus  giving  a  non-singular  transformation  in  wliich 

x'l  =  Xi,  (^  =  r-fl,  .-.Ti), 

and  such  that  it  reduces  (1)  to  the  form 

«-2 
^(a/j^jXa  Xj  • 

Proceeding  in  this  way  step  by  step,  our  theorem  is  at  last 
proved. 

In  the  next  two  theorems  we  denote  by  Aij  in  the  usual  way  the 
cof actor  of  «5^y  in  the  discriminant  a  of  the  quadratic  form  (1). 

Theorem  2.  i/'^^^^O,  new  variables  x\^  •••  xl  can  he  introduced 
hy  a  non-singular  transformation  in  such  a  way  that 


and  that  (1)  takes  the  form 


Zia^jX^Xj  -\-    .    x„. 


^nn 


*  This  transformation  should  be  compared  with  Exercise  2,  §  41. 
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V  _JL    2 


Its  discriminant  is 


a 


11 


^i,«-i 


a 


m 


n-h  1 

nl 


^n- 


a. 


»-l,  n—\       ^«-l,  n 
^n,  n-l  ^nn 


a 


Hence  by  means  of  a  non-singular  transformation  of  the  kind  used 
in  the  last  theorem,  an  essential  point  being  that  xl^  =  x^-,  we  get 


a 


n-l 


^  tfvV  X^  Xr!    ~~~     ~~. X~,     ^   Cv-i/i  Xo  X, 


r  ^r 


"ij-^i-^j 


A 


ij'^%'^3') 


or 


n  n-\  ^ 

Zd  d^j  X^  Xj  =  2j  d^j  X^  Xj  -\      -       X^  * 
^  ^1  -^«n 


Theorem  3.   If 

Ann  "^  ^n-l,«-l  ==  ^"i  -^n,»-l  ^  ^t 

new  variables  x\^  •  •  •  xl  can  he  introduced  hy  a  non-singular  transforma- 
tion in  such  a  way  that 


^n-l  —  '^n—V> 


and  that  (1)  takes  the  form 


n-2 


Zj  a^jX^Xj  -\- 


2a 


tj'^t'^j 


A 


^n —  -^w? 


n,  n-\ 


Let  us  denote  by  B  the  determinant  obtained  by  striking  out  the 
last  two  rows  and  columns  of  a.      Then  (cf.  Corollary  3,  §  11)  we 

have 

A  A 


(2) 


aB=: 


A  A 


=  -^l,„-i^O. 


Consider,  now,  the  quadratic  form 


(3) 


«  2a 

2^a^jXiXj  .  ^n^n-V 

^  -^«,«-l 
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Its  discriminant  is 


(4) 


a 


11 


a 


hn-\ 


a 


m 


^n-l,  1  '"  ^n-l,  n-l 


a 


n\ 


^n,n-l 


a 


-^n,  n-l 


^n-l,n 


a. 


a 


A 


=  a  — 


A  ^       —  A  ^       —  7? 

-^^n,  n-l  4  -^^n,  n—l  -*-* 


-*^w,  n—l 


A 


n,  n-l 


a 


A 

-^^n,  n—l 


which  has  the  value  zero,  as  we  see  by  making  use  of  (2).  Not  only- 
does  the  determinant  (4)  vanish,  but  its  principal  minors  obtained 
by  striking  out  its  last  row  and  column  and  its  next  to  the  last  row 
and  column  are  zero,  being  A^n  and  ^^_j^„_j  respectively.  The  minor 
obtained  by  striking  out  the  last  two  rows  and  columns  from  (4)  is 
B,  and,  by  (2),  this  is  not  zero.  Thus  we  see  (cf..  Theorem  1,  §  20) 
that  the  determinant  (4)  is  of  rank  n  —  2.  Hence,  by  Theorem  1, 
we  can  reduce  (3)  by  a  non-singular  linear  transformation  in  which 

^n-l  =  ^n-V    ^n  =  ^n    ^O  the  fomi 

la 

j  ^n^n-l 


^a. 


IjXj^Juj 


'•«,  n-l 


n-2 

1 


''IJ'^V^J' 


Hence 


n  w— 2  9  /» 

■'■  ^  -^n,  n-l 


Corollary.      Under  the  conditions  of  Theorem  3,  the  quadratic 
form  (1)  can  he  reduced  to  the  form 

2a 


n-2 

^  a^A  U/^  *€/,•  ~f~         . 

I   ''  '  ^     A 


y-f^n-l        -^n  ) 


n,  n—l 


by  a  non-singular  linear  transformation. 

To   see  this  we  have  merely  first  to  perform  the  reduction  of 

Theorem  3,  and  then  to  follow  this  by  the  additional  non-singular 

transformation  ...  .       .    ^ 

(^  =  l,  2,  ...ri-2), 


f  ^; 

= 

=^1 

^n-l 

= 

<- 

1 

— 

-      ^n 

=z 

<- 

•1 

+ 

x" 
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Having  thus  established  these  three  theorems,  the  method  of 
reducing  a  quadratic  form  completely  is  obvious.  If  the  form  (1)  is 
singular,  we  begin  by  reducing  it  by  Theorem  1  to 

r 

where  r  is  the  rank  of  the  form.  Unless  all  the  principal  (r—  1)- 
rowed  minors  of  the  discriminant  of  this  form  are  zero,  the  order 
of  the  variables  rr^,  •  •  •  x^.  can  be  so  arranged  that  the  reduction  of 
Theorem  2  is  possible,  a  reduction  which  may  be  regarded  as  a  non- 
singular  linear  transformation  of  all  n  variables.  If  all  the  princi- 
pal (r  —  1) -rowed  minors  are  zero,  there  will  be  at  least  one  of  the 
cofactors  A^j  which  is  not  zero,  and,  by  a  suitable  rearrangement  of 
the  order  of  the  variables,  this  may  be  taken  as  A^^^-v  "^^^  reduc- 
tion of  Theorem  3,  Corollary,  will  then  be  possible.  Proceeding  in 
this  way,  we  finally  reach  the  result,  precisely  as  in  Theorem  1,  §  46, 
that  a  quadratic  form  of  rank  r  can  always  be  reduced  by  a  non- 
singular  linear  transformation  to  the  form 

/>     ^2      I       ...       I      p    nnZ 
O-i  •//-«    ^^     •  •  •     ^^    V^Uy^.t 

It  may  be  noticed  that  the  arrangement  of  the  transformation  of 
this  section  is  in  a  certain  sense  precisely  the  reverse  of  that  of  §45, 
inasmuch  as  we  here  leave  at  each  step  the  coefficients  of  the  unre- 
duced part  of  the  form  unchanged,  but  change  the  variables  which 
enter  into  this  part ;  while  in  §45  we  change  the  coefficients  of  the 
unreduced  part,  but  leave  the  variables  in  it  unchanged. 
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50.  The  Law  of  Inertia.  We  come  now  to  the  study  of  real 
quadratic  forms  and  the  effect  produced  on  them  by  real  linear 
transformations. 

We  notice,  here,  to  begin  with,  that  the  only  operations  involved 
in  the  last  chapter  are  rational  operations  {i.e.  addition,  subtraction, 
multiplication,  and  division)  with  the  single  exception  of  the  radicals 
which  come  into  formula  (2),  §  46.  In  particular  the  reduction  of 
§  45  (or  the  alternative  reduction  of  §  49)  involves  only  rational  oper- 
ations. Consequently,  since  rational  operations  performed  on  real 
quantities  give  real  results,  we  have 

Theorem  1.  A  real  quadratic  form  of  rank  r  can  he  reduced  hy 
means  of  a  real  non-singular  linear  transformation  to  the  form 

(  -L  )  11      i"     2    2      ■"    * '  *      •"  ^r^r 

where  c^,  •••  c^  are  real  constayits  none  of  which  are  zero. 

As  we  saw  in  the  last  chapter,  this  reduction  can  be  performed  in  a 
variety  of  ways,  and  the  values  of  the  coefficients  c^,  •  •  •  c^  in  the  reduced 
form  will  be  different  for  the  different  reductions.  The  signs  of  these 
coefficients,  apart  from  the  order  in  which  they  occur,  will  not  depend 
on  the  particular  reduction  used,  as  is  stated  in  the  following  im- 
portant theorem  discovered  independently  by  Jacobi  and  Sylvester 
and  called  by  the  latter  the  Law  of  Inertia  of  Quadratic  Forms : 

Theorem  2.    If  a  real  quadratic  form  of  rank  r  is  reduced  hy  two 
real  non-singular  linear  transformations  to  the  forms  (1)  and 
(2)  h^x^l'^  -t-  \xl^  +  .  •  •  +  h^^l^ 

respectively^  then  the  numher  of  positive  es  in  (1)  is  equal  to  the  numher 
of  positive  k^s  in  (2). 

In  order  to  prove  this,  let  us  suppose  that  the  x'^s  and  x"^s  have 
been  so  numbered  that  the  first  /i  of  the  c's  and  the  first  v  of  the  k's 
are  positive  while  all  the  remaining  <?'s  and  k's  are  negative.     Our 

144 


REAL  QUADRATIC   FORMS  145 

theorem  will  be  established  if  we  can  show  that  /jl  =  v.  If  this  is 
not  the  case,  one  of  the  two  integers  /i  and  v  must  be  the  greater, 
and  it  is  merely  a  matter  of  notation  to  assume  that  fJL>  v.  We  will 
prove  that  this  assumption  leads  to  a  contradiction. 

If  we  regard  the  x'^s  and  x"^s  simply  as  abbreviations  for  certain 
linear  forms  in  the  x's,  (1)  and  (2)  are  both  of  them  identically  equal 
to  the  original  quadratic  form,  and  hence  to  each  other.  This  iden- 
tity may  be  written 


=k^x^{^-{-  ..•  +hx'l^-\h^^\x^ll^-' \k 


112 


T 

Let  us  now  consider  the  system  of  homogeneous  linear  equations 
in  (rrp  "-Xn), 
(4)  x'l  =  0,  .-  x^:  =  0,  xl^,  =  0,  ...  x^,  =  0. 

We  have  here  v  -{-n  —  /jL<n  equations.  Hence,  by  Theorem  3, 
Corollary  1,  §17,  we  can  find  a  solution  of  these  equations  in  which 
all  the  unknowns  are  not  zero.  Let  (y^,  ...  ^^)  be  such  a  solution  and 
denote  by  ^/J,  ^'l  the  values  of  x^^  x'l  when  the  constants  ^/j, .  •  •  y^  are 
substituted  in  them  for  the  variables  x^^  •••  x^^.  Substituting  the  y's 
for  the  ic's  in  (3)  gives 


^i/i^H-  •••  4-c^^I?=  -\K+i\y'lli-  •••  -\K\y 


f'2 
r   • 


The  expression  on  the  left  cannot  be  negative,  and  that  on  the  right 
cannot  be  positive,  hence  they  must  both  be  zero;  and  this  is  pos- 
sible only  if  r  r      r. 

But  by  (4)  we  also  have     ^jj^+j  =  •••  =  i/J^  =  0. 

That  is,  (j/i,  "-yn)  is  a  solution,  not  composed  exclusively  of  zeros,  of 
the  system  of  n  homogeneous  linear  equations  in  n  unknowns, 

a:;  =  0,  4  =  0,  ...:r^=0. 

The  determinant  of  these  equations  must  therefore  be  zero,  that  is, 
the  linear  transformation  which  carries  over  the  x's  into  the  a:''s  must 
be  a  singular  transformation.  We  are  here  led  to  a  contradiction, 
and  our  theorem  is  proved. 
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We  can  thus  associate  with  every  real  quadratic  form  two  in- 
tegers P  and  i\r,  namely,  the  number  of  positive  and  negative  coeffi- 
cients respectively  which  we  get  when  we  reduce  the  form  by  any 
real  non-singular  linear  transformation  to  the  form  (1).  These  two 
numbers  are  evidently  arithmetical  invariants  of  the  quadratic  form 
with  regard  to  real  non-singular  linear  transformations,  since  two 
real  quadratic  forms  which  can  be  transformed  into  one  another  by 
means  of  such  a  transformation  can  obviously  be  reduced  to  the  same 
expression  of  form  (1).* 

The  two  arithmetical  invariants  P  and  iV  which  we  have  thus 
arrived  at,  and  the  arithmetical  invariant  r  which  we  had  before,  are 
not  independent  since  we  have  the  relation 

(5)  P-\-N=r, 

One  of  the  invariants  P  and  N  is  therefore  superfluous  and  either 
might  be  dispensed  with.  It  is  found  more  convenient,  however, 
to  use  neither  P  nor  iV,  but  their  difference, 

(6)  s  =  P-N, 

which  is  called  the  signature  of  the  quadratic  form. 

Definition.  By  the  signature  of  a  real  quadratic  form  is  under- 
stood the  difference  between  the  number  of  positive  and  the  number  of 
negative  coefficients  which  we  obtain  when  we  reduce  the  form  by  any 
real  non- singular  linear  transformation  to  the  form  (1). 

Since  the  integers  P  and  iVused  above  were  arithmetical  invari- 
ants, their  difference  s  will  also  be  an  arithmetical  invariant.  It 
should  be  noticed,  however,  that  s  is  not  necessarily  a  positive  in- 
teger.    We  have  thus  proved 

Theorem  3.  The  signature  of  a  quadratic  form  is  an  arithmetical 
invariant  with  regard  to  real  non-singular  linear  transformations. 

EXERCISES 

1.  Prove  that  the  rank  r  and  the  signature  5  of  a  quadratic  form  are  either 

both  even  or  both  odd  ;  and  that  ^    ^ 

'  — r<s<r. 

2.  Prove  that  any  two  integers  r  and  s  (r  positive  or  zero)  satisfying  the  con- 
ditions of  Exercise  1  may  be  the  rank  and  signature  respectively  of  a  quadratic 
form. 

*  P  is  sometimes  called  the  index  of  inertia  of  the  quadratic  form. 
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3.  Prove  that  a  necessary  and  sufficient  condition  that  a  real  quadratic  form 
of  rank  r  and  signature  s  be  factorable  into  two  real  linear  factors  is  that 

either  r<2; 

or  r  =  2,  5  =  0. 

4.  A  quadratic  form  of  rank  r  shall  be  said  to  be  regularly  arranged  (cf.  §  20, 
Theorem  4)  if  the  a;'s  are  so  numbered  that  no  two  consecutive  J.'s  are  zero  in  the  set 


Aq  —  1,  -4j  —  Ojj,  A^  — 


11  '"    1^ 

^11^12  J     _     •        • 

^21  ^22  *         * 

and  that  Ar  ^  0.     Prove  that  if  the  form  is  real  and  any  one  of  these  A's  is  zero, 
the  two  adjacent  A's  have  opposite  signs. 

[Suggestion.   In  this  exercise  and  the  following  ones,  the  work  of  §  49  should  be 

consulted.] 

5.  Prove  that  the  signature  of  a  regularly  arranged  real  quadratic  form  is 
equal  to  the  number  of  permanences  minus  the  number  of  variations  of  sign  in  the 
sequence  of  the  -4's,  if  the  ^'s  which  are  zero  are  counted  as  positive  or  as  nega- 
tive at  pleasure. 

6.  Defining  the  expression  sgn  x  (read  signum  x)  by  the  equations 

sgnx  =  4-  1  a:>0, 
sgn  a:  =  0  a;  =  0, 
sgnx=  —  1         a;<0, 

show  that  the  signature  of  a  regularly  arranged  real  quadratic  form  of  rank  r  is 

sgn  (^o^i)  +  sgn (^1^2)  +  •••  +  sgn  (Ar-iAr). 

51.  Classification  of  Real  Quadratic  Forms.  We  saw  in  the  last 
section  that  a  real  quadratic  form  has  two  invariants  with  regard  to 
real  non-singular  linear  transformations,  —  its  rank  and  its  signa- 
ture. The  main  result  to  be  established  in  the  present  section 
(Theorem  2)  is  that  these  two  invariants  form  a  complete  system. 

If  in  §  46  the  es  and  Fs  are  real,  the  transformation  (2)  will  be 
real  when,  but  only  when,  each  c  has  the  same  sign  as  the  corre- 
sponding k.  All  that  we  can  infer  from  the  reasoning  of  that  section 
now  is,  therefore,  that  if  a  real  quadratic  form  of  rank  r  can  be 
reduced   by  a  real  non-singular  linear  transformation  to  the  form 

1  "^l    '      ' '  *      '      r  *^ri 

it  can  also  be  reduced  by  a  real  non-singular  linear  transformation  to 
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where  the  Fs  are  arbitrarily  given  real  constants,  not  zero,  subject 
to  the  condition  that  each  k  has  the  same  sign  as  the  corresponding  c. 
Using  the  letters  P  and  iV  for  the  number  of  positive  and  negative  es 
respectively,  the  transformation  can  be  so  arranged  that  the  first 
P  cs  are  positive,  the  last  iV  negative.  Accordingly  the  first  P  k's 
can  be  taken  as  +  1,  the  last  iV  as  —  1.  From  equations  (5)  and  (6) 
of  §  50,  we  see  that  P  and  iVmay  be  expressed  in  terms  of  the  rank 
and  signature  of  the  form  by  the  formulae 

(1)  ^  =  ^.         ^=^- 


Thus  we  have  the  theorem  : 

Theorem  1.  A  real  quadratic  form  of  rank  r  and  signature  s  can  he 
reduced  by  a  real  non-singular  linear  transformation  to  the  normal  form 

(2)  x\+ \-x%  —  x\+^— xl 

where  P  is  given  by  (1). 

We  are  now  able  to  prove  the  fundamental  theorem  : 

Theorem  2.  A  necessary  and  sufficient  condition  that  two  real 
quadratic  forms  be  equivalent  with  regard  to  real  non-singular  linear 
transformations  is  that  they  have  the  same  rank  and  the  same  signature. 

That  this  is  a  necessary  condition  is  evident  from  the  invariance 
of  rank  and  signature.  That  it  is  sufficient  follows  from  the  fact 
that  if  the  two  forms  have  the  same  rank  and  signature,  they  can 
both  be  reduced  to  the  same  normal  form  (2). 

Definition.  All  real  quadratic  forms,  equivalent  with  regard  to 
real  non-singular  linear  transformations  to  a  given  form,  and  therefore  to 
each  other,  are  said  to  form  a  class.* 

Thus,  for  instance,  since  every  real  non-singular  quadratic  form 
in  four  variables  can  be  reduced  to  one  or  the  other  of  the  five 
normal  forms, 


(3) 


/y»2  _J_  o^2     I     /y>2  _4_  0^2 

.€/j  -J-  .t/2     1     •*'3  n^  **'4i 

/y»2       I       /v'2       I       /j^2    ^_  /ya 

•^2     '        2     '        3  4' 

o»2      I       0^2    nr"    ^_  /y'2 

•*'l     1     •*^2            3  4' 

n^a    ^^    /yZ    __    /y4    _^  /ya 

•*"!          •*'2             3  4' 

_^    /y«2    /v>2    __    /Y'2    ^_  /y'2 

•*'l          •*'2          "^3  *^4' 

*  This  term  may  be  used  in  a  similar  manner  whenever  the  conception  of  equiva- 
lence is  involved. 
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we  see  that  all  such  forms  belong  to  one  or  the  other  of  five  classes 
characterized  by  the  values 

s  =  4,  2,  0,  -  2,  -  4,         r  =  4. 

If,  however,  as  is  the  case  in  many  problems  in  geometry,  we  are 
concerned  not  with  quadratic  forms,  but  with  the  equations  obtained 
by  equating  these  forms  to  zero,  the  number  of  classes  to  be  distin- 
guished will  be  reduced  by  about  one  half,  since  two  equations  are 
the  same  if  their  first  members  differ  merely  in  sign. 

Thus  there  are  only  three  classes  of  non-singular  quadric  surfaces 
with  real  equations,  whose  normal  forms  are  obtained  by  equating 
the  first  three  of  the  forms  (3)  to  zero.  These  equations  written  in 
non-homogeneous  coordinates  are 

X2  4-    F2  +  Z2  =  -  1, 

X^-\-  Y^-  Z^  =  1. 

The  first  of  these  represents^  an  imaginary  sphere,  the  second  a  real 
sphere,  and  the  third  an  unparted  hyperboloid  generated  by  the  revolu- 
tion of  a  rectangular  hyperbola  about  its  conjugate  axis.  It  may  readily 
be  proved  that  this  last  surface  may  also  be  generated  by  the  revolution 
of  either  of  the  lines  V  =  1    X  =  +  Z 

about  the  axis  of  Z.     We  may  therefore  say  : 

Theorem  3.  There  are  three^  and  only  three^  classes  of  non-singular 
quadric  surfaces  with  real  equations.  In  the  first  the  surfaces  are  imag- 
inary ;  in  the  second  real^  hut  their  rulings  are  imaginary  ;  in  the  third 
they  are  real^  and  the  rulings  through  their  real  points  are  real.^ 

This  classification  is  complete  from  the  point  of  view  we  have 
adopted  of  regarding  quadric  surfaces  as  equivalent  if  one  can  be 
transformed  into  the  other  by  a  real  non-singular  coUineation.  The 
more  familiar  classification  does  not  adopt  this  projective  view,  but 
distinguishes  in  our  second  class  between  ellipsoids,  biparted  hyper- 
boloids,  and  elliptic  paraboloids  ;  and  in  the  third  class  between  un- 
parted hyperboloids  and  hyperbolic  paraboloids. 

*  If,  as  here,  we  consider  not  real  quadratic  forms,  but  real  homogeneous  quadratic 
equations  we  must  use,  not  s,  but  \s\  as  an  invariant.  In  place  of  \s\  we  may  use  what  is 
known  as  the  characteristic  of  the  quadratic  form,  that  is  the  smaller  of  the  two  in- 
tegers P,  N.    This  characteristic  is  simply  \(r  —  \s\). 
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EXERCISES 

1.  Prove  that  there  are  |(n  +  1)  (n  +  2)  classes  of  real  quadratic  forms  in  n 
variables. 

2.  Give  a  complete  classification  of  singular  quadric  surfaces  with  real  equa- 
tions from  the  point  of  view  of  the  present  section. 

52.   Definite  and  Indefinite  Forms. 

Definition.  By  an  indefinite  quadratic  form  is  understood  a  real 
quadratic  form  such  that^  when  it  is  reduced  to  the  normal  form  (2), 
§  51,  hy  a  real  non-singular  linear  transformation,  both  positive  and  neg- 
ative signs  occur.  All  other  real  quadratic  forms  are  called  definite  ;  * 
and  we  distinguish  between  positive  and  negative  definite  forms  accord- 
ing as  the  terms  in  the  normal  form  are  all  positive  or  all  negative. 

In  other  words,  a  real  quadratic  form  of  rank  r  and  signature  s 
is  definite  if  s  =  ±  r,  otherwise  it  is  indefinite.! 

The  names  definite  and  indefinite  have  been  given  on  account  of 
the  following  fundamental  property: 

Theorem  1.  An  indefinite  quadratic  form  is  positive  for  some  real 
values  of  the  variables^  negative  for  others.  A  positive  definite  form  is 
positive  or  zero  for  all  real  values  of  the  variables ;  a  negative  definite 
form^  negative  or  zero. 

The  part  of  this  theorem  which  relates  to  definite  forms  follows 
directly  from  the  definition.  To  prove  the  part  concerning  indefi- 
nite forms,  suppose  the  form  reduced  by  a  real  non-singular  linear 
transformation  to  the  normal  form 


(1)  42  +  ...+^^_^/^^^ ^^ 


Regarding  the  x^''s  as  abbreviations  for  certain  real  linear  forms  in 
the  x^s^  let  us  consider  the  system  oi  n  —  P  homogeneous  linear  equa- 
tions 
(2)  ;Bi,^l  =  0,  :ei,  +  2  =  0,-4=0. 

Since  these  equations  are  real,  and  their  number  is  less  than  the 
number   of    unknowns,   they  have    a    real    solution   not    consisting 

*  Some  writers  reserve  the  name  definite  for  non-singular  forms,  and  call  the 
singular  definite  forms  semidefinite. 

t  Otherwise  stated,  the  condition  for  a  definite  form  is  that  the  characteristic  he 
zero.     Cf.  the  footnote  to  Theorem  3,  §  51. 
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excltisively  of  zeros.  Let  (?/p  •••  y„)  be  such  a  solution.  This 
solution  cannot  satisfy  all  the  equations 

(3)  x[  =  0,'"x'p  =  0, 

for  equations  (2)  and  (3)  together  form  a  system  of  n  homogeneous 
linear  equations  in  n  unknowns  whose  determinant  is  not  zero,  since 
it  is  the  determinant  of  the  linear  transformation  which  reduces  the 
given  quadratic  form  to  the  normal  form  (1).  Accordingly,  if  we 
substitute  (^^  •  •  •  ?/„)  for  the  variables  (a;^,  •  •  •  x^)  in  the  given  quad- 
ratic form,  this  form  will  have  a  positive  value,  as  we  see  from  the 
reduced  form  (1). 

Similarly,  by  choosing  for  the  x's  a  real  solution  of  the  equations 

2^2  =  U,  •  •  •  Xp  =  U,  ^r+i  ^^  ^1  *  ■  ■  ^»  =  '-'? 

which  does  not  consist  exclusively  of  zeros,  we  see  that  the  quad- 
ratic form  takes  on  a  negative  value. 

We  pass  now  to  some  theorems  which  will  be  better  appreciated 
by  the  reader  if  he  considers  their  geometrical  meaning  in  the 
case  n  =  4. 

Theorem  2.  If  an  indefinite  quadratic  form  is  positive  at  the  real 
point  (?/j,  •••  3/^)  and  negative  at  the  real  point  (2^,  •••  s^),  then  there 
are  two  real  points  linearly  dependent  on  these  two^  hut  linearly  inde- 
pendeyit  of  each  other^  at  which  the  quadratic  form  is  zero^  and  neither 
of  which  is  a  vertex  of  the  form. 

The  condition  that  the  quadratic  form 

n 
y±)  Z^ayXjXj 

vanish  at  the  point  («/j  +  X^^,  •••  y^  +  \z^  is 

n  n  n 

^%-^i^j  +  2  X^a^^iZj  +  X^l^a^jZiZj  =  0. 

This  quadratic  equation  in  X  has  two  real  distinct  roots,  since, 
from  our  hypothesis  that  (4)  is  positive  at  y  and  negative  at  z^  it 
follows  that 

Let  us  call  these  roots  \  and  X2.     Then  the  points 
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Vi  +  '^x^i    Vj  4-  ^1% 

1  Xj 

ViVi 

Vi  +  Xg^i    yj  +  \^Zj 

1X2 

Zi  Zj 

are  two  real  points  linearly  dependent  on  the  points  y  and  z  at  Which 
(4)  vanishes. 

Next  notice  that 


(6) 


Since  the  points  y  and  z  are  linearly  independent,  the  integers  z,/ 
can  be  so  chosen  that  the  last  determinant  on  the  right  of  (6)  is  not 
zero.  Then  the  determinant  on  the  left  of  (6)  is  not  zero;  and, 
consequently,  the  points  (5)  are  linearly  independent. 

In  order,  finally,  to  prove  that  neither  of  the  points  (5)  is  a 
vertex,  denote  them  for  brevity  by 

Letting  Xj  —  Xg  =  l//>t,  we  have 

Zi  =  tiYi  —  ixZi 
Therefore 
(7)  ^ai^ZiZj  =  fM^l^aijYiYj  —  2  fi^I^aijYiZj  +  fju^'laijZiZj. 


0'  =  1,  2,  ...71). 


Since  the  points  Y  and  Z^  have  been  so  determined  that  (4)  vanishes 
at  them,  the  first  and  last  terms  on  the  right  of  (7)  are  zero.  If 
either  Y  or  Z  were  a  vertex,  the  middle  term  would  also  be  zero  ; 
but  this  is  impossible  since  the  left-hand  member  of  (7)  is,  by 
hypothesis,  negative.     Thus  our  theorem  is  proved. 

For  the  sake  of  completeness  we  add  the  corollary,  whose  truth 
is  at  once  evident  : 

Corollary.  The  only  points  linearly  dependent  on  y  and  z  at 
which  the  quadratic  form  vanishes  are  points  linearly  dependent  on  one 
or  the  other  of  the  points  referred  to  in  the  theorem;  and  none  of  these 
are  vertices. 

We  come  now  to  a  theorem  of  fundamental  importance  in  the 
theory  of  quadratic  forms. 

Theorem  3.  A  necessary  and  sufficient  condition  that  a  real 
quadratic  form  he  definite  is  that  it  vanish  at  no  real  points  except  its 
vertices  and  the  point  (0,  0,  ••.  0). 
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Suppose,  first,  that  we  have  a  real  quadratic  form  which  vanishes 
at  no  real  points  except  its  vertices  and  the  point  (0,  0,  •••  0).  If  it 
were  indefinite,  we  could  (Theorem  1)  find  two  real  points  y,  z^  at 
one  of  which  it  is  positive,  at  the  other,  negative.  Hence  (Theorem 
2)  we  could  find  two  real  points  linearly  dependent  on  y  and  2,  at 
which  the  quadratic  form  vanishes.  Neither  of  these  will  be  the 
point  (0,  0,  •••  0),  since,  by  Theorem  2,  they  are  linearly  independ- 
ent. Moreover,  they  are  neither  of  them  vertices.  Thus  we  see 
that  the  form  must  be  definite,  and  the  sufficiency  of  the  condition 
is  established. 

It  remains  to  be  proved  that  a  definite  form  can  vanish  only  at 
its  vertices  and  at  the  point  (0,  0,  •••  0). 

Suppose  (4)  is  definite  and  that  (?/j,  •••  y^  is  any  real  point  at 
which  it  vanishes.     Then, 

n  n  n 

^aij{xi-\-  \yi){xj  +  X^y)  =  ^a^jXiXj  +  2  Xla^jXiyj. 

If  y  were  neither  a  vertex  nor  the  point  (0,  0,  •••  0),  ^a^jXiyj  would 

not  vanish  identically,  and  we  could  find  a  real  point  (z^^  •••  z^)  such 

that  n 

k  =  ^a^f^y^  ^  0, 


If  we  let  c  =  'la^-ZjZp 

we  have 

n 

(8)  ^%{zi  4-  '^!/i)(Zj  +  ^yj)  =  c  +  2Xk, 

For  a  given  real  value  of  \,  the  left-hand  side  of  this  equation 
is  simply  the  value  of  the  quadratic  form  (4)  at  a  certain  real  point. 
Accordingly,  for  different  values  of  X  it  will  not  change  sign,  while 
the  right-hand  side  of  (8)  has  opposite  signs  for  large  positive 
and  large  negative  values  of  X.  Thus  the  assumption  that  y  was 
neither  a  vertex  nor  the  point  (0,  0,  •••  0)  has  led  to  a  contradiction; 
and  our  theorem  is  proved. 

Corollary.     A   non-singular   definite   quadratic  form  vanishes^ 
for  real  values  of  the  variables,  only  when  its  variables  are  all  zero. 

As  a  simple  application  of  the  last  corollary  we  will  prove 

Theorem  4:.  In  a  non-singular  definite  form,  none  of  the  coeffi- 
cients of  the  square    terms  can  be  zero. 
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For  suppose  the  form  (4)  were  definite  and  non-singular;  and 
that  %  =  0.     Then  the  form  would  vanish  at  the  point 

x^=  •••  =Xi_^  =  ^i+2  =  •••  =a:^„  =  U,  a?^=  1 ; 
and  this  is  impossible,  since  this  is  not  the  point  (0,  0, ...  0^ 

EXERCISES 

1.  Definition.  By  an  orthogonal  transformation  *  is  understood  a  linear  trajis- 
formation  which  carries  over  the  variables  (ari,.--^n)  into  the  variables  (a;(,«-.a:„)  in  such 
a  way  that 

xl-\-xl+  ..•  +  xl  =  x[^  +  xfi  +  ...  +  x'^. 

Prove  that  every  orthogonal  transformation  is  non-singular,  and,  in  particular, 
that  its  determinant  must  have  the  value  +  1  or  —  1. 

2.  Prove  that  all  orthogonal  transformations  in  n  variables  form  a  group ;  and 
that  the  same  is  true  of  all  orthogonal  transformations  in  n  variables  of  deter- 
minant +  1. 

3.  Prove  that  a  necessary  and  sufficient  condition  that  a  linear  transforma- 
tion be  orthogonal  is  that  it  leave  the  "  distance  " 


V(2/i  -  ^i)2  +  {y2  -  z^y  +  -  +  (z/n  -  z^y 

between  every  pair  of  points  (z/i,  •••  ?/„),  (21,  .♦.  2;„)  invariant. 

4.  Prove  that  if  n  =  3,  and  if  x\^  x^,  x^  be  interpreted  as  non-homogeneous 
rectanglar  coordinates  in  space,  an  orthogonal  transformation  represents  either  a 
rigid  displacement  which  leaves  the  origin  fixed,  or  such  a  displacement  combined 
with  reflection  in  a  plane  through  the  origin. 

Show  that  the  first  of  these  cases  will  occur  when  the  determinant  of  the 
transformation  is  +  1,  the  second  when  this  determinant  is  —  1. 

5.  If  the  coefficients  of  a  linear  transformation  are  denoted  in  the  usual  way 
by  Cy-,  prove  that  a  necessary  and  sufficient  condition  that  the  transformation  be 
orthogonal  is  that  ^2.  ^  ^2.  ^  _  ^^2.  ^  ^  ^ •  ^  1^  2, ...  rz), 

.  fi  =  l,2, ...  n.       . 

C\iCy  +  CiiC<ij  +  •••  +  CniCnj  =  ^  1    .        ^    ^  *  ^J' 

\j  =1,2,  ...n 

Show  that  these  will  still  be  necessary  and  sufficient  conditions  for  an  orthogonal 
transformation  if  the  two  subscripts  of  every  c  be  interchanged. f 

*  The  matrix  of  such  a  transformation  is  called  an  orthogonal  matrix,  and  its  deter- 
minant an  orthogonal  determinant. 

t  We  have  here  \  n  {n  -{- V)  relations  between  the  n'^  coefficients  of  the  transforma- 
tion.    This  suggests  that  it  should  be  possible  to  repress  all  the  coefficients  in  terms  of 

7i2  _  n{n  +  l)  _  njn-V) 
2  2 

of  them,  or  if  we  prefer  in  terms  oi\n{n  —  l)  other  parameters.  For  Cayley's  dis- 
cussion of  this  question  cf .  Pascal's  book,  Die  Determinanten,  §  47.  Cayley's  formulae, 
however,  do  not  include  all  orthogonal  transformations  except  as  limiting  cases. 


CHAPTER  XII 

THE   SYSTEM   OF   A   QUADRATIC    FORM   AND   ONE   OR   MORE 

LINEAR    FORMS 

53.   Relations  of  Planes  and  Lines  to  a  Quadric  Surface.     If  th^ 
plane 

(1)  u-^x^  +  U2X2  +  u^x^  +  u^x^  =  0 

is  a  true  tangent  plane  to  the  quadric  surface 

4 

(2)  ^ciijXiXj  =  0, 

there  will  be  a  point  (y^,  ^^^  ^g,  y^)  (namely  the  point  of  contact) 

lying  in  (1)  and  such  that  its  polar  plane 

~        4 

(3)  ^%-^i^j  =  0 

coincides  with  (1).  From  elementary  analytic  geometry  we  know 
that  a  necessary  and  sufficient  condition  that  two  equations  of  the 
first  degree  represent  the  same  plane  is  that  their  coefficients  be  pro- 
portional. Accordingly,  from  the  coincidence  of  (1)  and  (3),  we 
deduce  the  equations 

^11^1  +  ^12^2  +  ^13^3  +  ^14^4  -  P^l  =  ^' 

^21^1  +  ^22^2  +  ^23^3  +  ^24^4  "  P^h  =  ^' 

^31^1  +  ^32^2  +  «33^3  +  ^34^4  "  P^3  =  ^' 

.  ^41^1  +  ^42^2  +  ^43^3  +  ^44^4  "  P'^4:  =  ^• 


(4) 


From  the  fact  that  the  point  2/  lies  on  (1),  we  infer  the  further 
relation 

(5)  'i^l^l +  ^2^2 +  ^3^3  +  ^42^4  =^- 

These  equations  (4)  and  (5)  have  been  deduced  on  the  suppo- 
sition that  (1)  is  a  true  tangent  plane  to  (2).  They  still  hold  if 
it  is  a  pseudo-tangent  plane  ;  for  then  the  quadric  must  be  a  cone, 
and  a  vertex  of  this  cone  must  lie  on  (1).  Taking  the  point  ^  as 
this  vertex,  equation  (5)  is  fulfilled.     Moreover,  since  now  the  first 
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member  of  (3)  is  identically  zero,  equations  (4)  will  also  be  fulfilled 
if  we  let  p  =  0.  Thus  we  have  shown  in  all  cases,  that  if  (1)  is  a 
tangent  plane  to  (2),  there  exist  five  constants,  y^,  t/^i  3/3,  y^,  /a,  of 
which  the  first  four  are  not  all  zero,  and  which  satisfy  equations  (4) 
and  (5).     Hence 


(6) 


a 


11 


a 


21 


a 


31 


a 


41 


^12 

^13 

^14 

^22 

^23 

^24 

^32 

^33 

^34 

^42 

«43 

«44 

U-, 


Uc 


Uc 


u. 


U-, 


Uc 


Ur 


0 


=  0. 


Conversely,  if  this  last  equation  is  fulfilled,  there  exist  five 
constants,  ?/j,  1/2,  ^^^  ^4^  P")  ^^^  ^^^  zero,  and  which  satisfy  equations 
(4)  and  (5).  We  can  go  a  step  farther  and  say  that  ^/i^  ^2'  ^3'  ^4 
cannot  all  be  zero,  as  otherwise,  from  equations  (4)  and  the  fact  that 
the  u's  are  not  all  zero,  p  would  also  be  zero.  Thus  we  see  that  if 
equation  (6)  is  fulfilled,  there  exists  a  point  (^/j,  ^2^  l/s^  ^4)  i^  ^^® 
plane  (1)  whose  coordinates,  together  with  a  certain  constant  p, 
satisfy  (4).  If  )0  =  0,  this  shows  that  the  quadric  is  a  cone  with  y 
as  a  vertex,  and  hence  that  (1)  is  at  least  a  pseudo-tangent  plane. 
If  /o  =?t  0,  equations  (4)  show  us  that  the  polar  plane  (3)  of  y  coin- 
cides with  the  plane  (1).  Moreover  we  see,  either  geometrically,  or 
by  multiplying  equations  (4)  by  ^/j,  1/2,  3/3,  ^/^  respectively  and  add- 
ing, that  the  point  ?/  lies  on  the  quadric ;  so  that,  in  this  case,  (1)  is 
a  true  tangent  plane. 

We  have  thus  established  the  theorem : 

Theorem  1.  Equation  (6)  is  a  necessary  and  sufficient  condition 
that  the  plane  (1)  he  tangent  to  the  quadric  (2). 

It  will  be  seen  that  this  theorem  gives  us  no  means  of  distinguish- 
ing between  true  and  pseudo-tangent  planes  of  quadric  cones.  In 
the  case  of  non-singular  quadrics,  pseudo-tangent  planes  are  impos- 
sible, and  therefore  equation  (6)  may,  in  this  case,  be  regarded  as  the 
equation  of  the  quadric  in  plane-coordinates. 

In  the  case  of  a  quadric  surface  of  rank  3,  that  is,  of  a  cone  with 
a  single  vertex,  the  coordinates  {u^,  U2->  %,  u^  of  every  plane  through 
this  vertex  satisfy  equation  (6),  so  that  in  this  case  this  equation 
represents  a  single  point,  and  not  the  quadric  cone.* 


*  111  fact  a  cone  cannot  be  represented  by  a  single  equation  in  plane-coordinates. 
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If  the  rank  of  (2)  is  less  than  3,  the  coordinates  of  every  plane 
in  space  should  satisfy  (6),  since  every  such  plane  passes  through  a 
vertex  and  is  therefore  a  tangent  plane.  This  fact  may  be  verified 
by  noticing  that  equation  (6)  may  also  be  written 

4 

where  the  J.'s  are  the  cofactors  in  the  discriminant  of  (2)  according 
to  our  usual  notation. 

We   pass  now  to  the  condition  that  a  straight  line   touch   the 
quadric  (2).     This  line  we  will  determine  as  the  intersection  of  the 
two  planes  (1)  and 
(7)  v^x-^  H-  v^x^  4-  v^x^  +  v^x^  =  0. 

If  the  line  of  intersection  of  these  planes  is  a  true  tangent  to  (2), 
there  will  be  a  point  {y^^  y^^^  y^,  y^^  namely  the  point  of  contact,  lying 
upon  it,  and  such  that  its  polar  plane  (3)  contains  the  line.  It  must  there- 
fore be  possible  to  write  the  equation  of  this  polar  plane  in  the  form 


(8) 


2;(/iMj  +  vv>)Xi=  0; 


(9) 


and,  in  fact,  by  properly  choosing  the  constants  /a  and  v^  the  co- 
efficients of  (8)  may  be  made  not  merely  proportional,  but  equal  to 
the  coefficients  of  (3) : 

^21^1  +  ^22^2  +  ^23^3  +  ^24^4  "  M^2  "  ^^2=  ^' 

^31^1  +  ^32^2  +  ^33^3  +  ^34^4"  /^%  "  ^^3  =  ^' 

L  ^41^1  +  «42^2  +  ^43^3  +  ^44^4  "  /^^4  "  ^^4  =  ^• 

Since  the  point  y  lies  on  the  line  of  intersection  of  the  planes  (1) 
and  (7),  we  also  have  the  relations 

/-j^Qv  pl^l  +  ^2^2  +  %^3  +  ^4^4  =  ^' 

I  HVl  +  ^2^2  +  ^3^3  +  ^43/4  =  ^• 

Since  the  six  equations  (9)  and  (10)  are  satisfied  by  six  constants 
Vv  Vv  Vz''  ^4'  A^'  ^  ^^^  ^^^  zero,  we  infer  finally  the  relation 


(11) 


11 

21 

31 
41 


12 

22 


32 


42 


U^ 


Uc 


a 
a 
a 
a 


13 
23 
33 
43 


14 
24 
34 
44 


U, 


0 
0 


Vi 


=  0. 
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We  have  deduced  this  equation  on  the  supposition  that  the  line  of 
intersection  of  (1)  and  (7)  is  a  true  tangent  to  (2).  We  leave  it  to 
the  reader  to  show  that  (11)  holds  if  this  line  is  a  pseudo-tangent, 
and  also  if  it  is  a  ruling  of  (2). 

We  also  leave  it  for  him  to  show  that  if  (11)  holds,  the  line  of 
intersection  of  (1)  and  (7)  will  be  either  a  true  tangent,  a  pseudo- 
tangent,  or  a  ruling,  and  thus  to  establish  the  theorem: 

Theorem  2.  A  necessary/  and  sufficient  condition  that  the  line  of 
intersection  of  the  planes  (1)  and  (7)  be  either  a  tangent  or  a  ruling 
of  (2)  is  that  equation  (11)  be  fulfilled. 

On  expanding  the  determinant  in  (11),  it  will  be  seen  that  it  is 
a  quadratic  form  in  the  six  line-coordinates  qij  (cf .  Exercise  3,  §  35). 
Equation  (11)  may  therefore  be  regarded  as  the  equation  of  the 
quadric  surface  in  line-coordinates  if  the  surface  is  not  a  cone,  or  is 
a  cone  with  a  single  vertex.  If  the  rank  of  (2)  is  2,  so  that  the 
quadric  consists  of  two  planes,  (11)  is  the  equation  of  the  line  of 
intersection  of  these  planes.  While  if  the  rank  is  1  or  0,  (11)  is 
identically  fulfilled. 

EXERCISES 

1.  Two  planes  are  said  to  be  conjugate  with  regard  to  a  non-singular  quadric 
surface  if  each  passes  through  the  pole  of  the  other. 

Prove  that  if  (2)  is  a  non-singular  quadric,  a  necessary  and  sufficient  con- 
dition that  the  planes  (1)  and  (7)  be  conjugate  with  regard  to  it  is  the  vanishing 
of  the  determinant 


a 


*11       "12  "13  "14  "] 

''21       ^22  ^23  ^24  ^i 

Igj       a^2  ^33  ^34  ^c 

41       ^42  ^43  ^44  ^^ 

Vo  Vo  V.  0 


4 


How  must  this  definition  of  conjugate  planes  be  extended  in  order  that  this 
theorem  be  true  for  singular  quadrics  also  ? 

2.  Prove  that  if  (2)  is  a  non-singular  quadric,  a  necessary  and  sufficient  con- 
dition that  the  point  of  intersection  of  three  planes  lie  on  (2)  is  the  vanishing  of 
the  seven-rowed  determinant  formed  by  bordering  the  discriminant  of  (2)  with  the 
coefficients  of  the  three  planes. 

3.  Admitting  it  to  be  obvious  geometrically  that  a  necessary  and  sufficient  con- 
dition that  a  line  touch  a  non-singular  quadric  is  that  the  two  tangent  planes  which 
can  be  passed  through  this  line  should  coincide,  prove  that,  if  (2)  is  non-singular, 
a  necessary  and  sufficient  condition  that  the  line  of  intersection  of  (1)  and  (7) 

touch  (2)  is  ,4   .  X  ^4  ^         X      /i  ^         \9     A 

(2,  AijUiUj)  (5  AijViVj)  -  (2  AijUiVjy  =  0. 

4.  Show  algebraically  that  the  condition  of  Exercise  3  is  equivalent  to  (11). 
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54.  The  Adjoint  Quadratic  Form  and  Other  Invariants.  Passing 
now  to  the  case  of  n  variables,  we  -begin  by  considering  the  system 
consisting  of  a  quadratic  form  and  a  single  linear  form 

n 

(1)  ^ai^x^x. 


ij-^t-^p 


n 

'2uiX(. 


(2) 

The  geometrical  considerations  of  the  last  section  suggest  that  we 
form  the  expression 


(3) 


^AijUiUj=- 


a, 


0 


This,  it  will  be  seen,  is  a  quadratic  form  in  the  variables  (wi,  •••  Un) 
whose  matrix  is  the  adjoint  of  the  matrix  of  (1).  We  will  speak  of 
(3)  as  the  adjoint  oi  (1). 

The  invariance  of  (3)  is  at  once  suggested  by  the  fact  that  in  the 
case  n  =  4:  the  vanishing  of  \3)  gave  a  necessary  and  sufficient  con- 
dition for  a  projective  relation.     In  fact  we  will  prove  the  theorem  : 

Theorem  1.  The  adjoint  form  (3)  is  an  invariant  of  weight  two  of 
the  pair  of  forms  (1),  (2). 

Inasmuch  as  the  u's  are,  as  we  saw  in  §  34,  contragredient  to  the 
a;'s,  we  may  also  call  (3)  a  contravariant  (cf .  Definition  2,  §  34). 

In  order  to  prove  this  theorem  we  must  subject  the  x's  to  a  linear 
transformation,  r      _        r    , ,         f 

(4) 


•^n  —  ^nl*^i     1    *"     \   ^nn'^nt 


whose  determinant  we  will  call  c.  Let  us  denote  by  a[j  and  u[  re- 
spectively the  coefficients  of  the  quadratic  and  linear  form  into  which 
this  transformation  carries  (1)  and  (2). 

Let  us  now  introduce  an  auxiliary  variable  ^,  and  consider  the 
quadratic  form  in  a^i,  •••  x^^  t^ 


(5) 


II, 

^aijXiXj  -{-2t  (uj^Xj^  -\ h  u^x^). 


The  discriminant  of  this  form    is  precisely  the  determinant  in 
(3),  that  is,  the  negative  of  the  adjoint  of  (1). 
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Let  us  now  perform  on  the  variables  iCj,  •••ir„,  ^  the  linear  trans- 
formation given  by  formulae  (4)  and  the  additional  formula 

(6)  t  =  t'. 

The  determinant  of  this  transformation  is  c,  and  it  carries  over  the 
form  (5)  into  \:    t    i  i     n  u    i  i  i    f\ 

From   the  fact  that  the  discriminant  of  (5)  is  an   invariant  of 
weight  2,  we  infer  the  relation  we  wished  to  obtain  : 


^11   *'*  ^In  ^1 


^nn  ^n 


U' 


0 


=  (?2 


a 


11 


^\n     ^i 


a 


n\ 


^nn     '^n 


U-, 


u„ 


0 


The  method  just  used  admits  of  immediate  extension  to  the  proof 
of  the  following  more  general  theorem : 

Theorem  2.  If  we  have  a  system  consisting  of  a  quadratic  form  in 
n  variables  and  p  linear  forms^  the  (n  -\-  p^-rowed  determinant  formed  hy 
bordering  the  discriminant  of  the  quadratic  form  by  p  roivs  and  p 
columns  each  of  which  consists  of  the  coefficients  of  one  of  the  linear 
forms  is  an  invariant  of  weight  2. 

We  leave  the  details  of  the  proof  of  this  theorem  to  the  reader. 

If  the  discriminant  a  of  the  quadratic  form  (1)  is  not  zero,  we  may 
form  a  new  quadratic  form  whose  matrix  is  the  inverse  of  the  matrix 
of  (1).  This  quadratic  form,  which  is  known  as  the  inverse  or 
reciprocal  of  (1),  is  simply  the  adjoint  of  (1)  divided  by  the  discrimi- 
nant a.     We  will  prove  the  following  theorem  concerning  it : 

Theorem  3.  If  the  quadratic  form  (1)  is  non-singular^  it  will  be 
carried  over  into  its  inverse  by  the  non-singular  transformation 

(7)  x'i  =  a^^x^  +  •••  +  ai^x^  (i=  1, 2, ... n). 

For  we  have  ^  =  4       r 

^  a^j  x^  Xj  —  ^  x^x^» 


But  from  (7)  we  have 

a 


+ 


-Av 


in  „f 


a 


and  therefore 


as  was  to  be  proved. 


^  a^jXj^Xj 


1    a 


E  — ^  X'iX. 
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It  will  be  noticed  that  if  (1)  is  a  real  quadratic  form,  the  trans- 
formation (7)  is  real ;  and  from  this  follows 

Theorem  4.    A  real  non-singular  quadratic  form  and  its  inverse 
have  the  same  signature. 

EXERCISES 

1.    Given   a  quadratic   form   "^aijXiXj  and   two   linear   forms    ^UiXi,  ^ViXi. 
Prove  that 


n 

2^  AijUiVj  ^ 


"^11  ^in 


a 


V^        "■      Vn        0 


is  an  invariant  of  the  system  of  weight  2. 

2.  Generalize  the  theorem  of  Exercise  1  to  the  case  in  which  we  have  more 
than  two  linear  forms. 

3.  Prove  that  if  a  first  quadratic  form  is  transformed  into  a  second  by  the 
linear  transformation  of  matrix  c,  then  the  adjoint  of  the  first  will  be  transformed 
into  the  adjoint  of  the  second  by  4:he  linear  transformation  whose  matrix  is  the 
conjugate  of  the  adjoint  of  c. 

4.  Prove  a  similar  theorem  for  bilinear  forms. 

5.  State  and  prove  a  theorem  for  bilinear  forms  analogous  to  Theorem  3. 

55.   The  Rank  of  the  Adjoint  Form.     Suppose  the  discriminant  a 

n 

of  the  quadratic  form  ^a^-x^Xj  is  of  rank  r,   and  that  the  discrimi- 

n 

nant  A  of  its  adjoint  2  AijU^Uj  is  of  rank  R,     Then,  if  r<?^—  1,  all 

the  {n  —  l)-rowed  determinants  of  a  are  zero ;  but  these  are  the  ele- 
ments of  A,  hence  ^=0.  If  r  =  ?^  —  1,  at  least  one  of  the  elements 
of  A  is  not  zero,  and  all  two-rowed  determinants  of  A  are  zero  (since 
by  §  11  each  of  them  contains  a  as  a  factor),  hence  R=l.  li  r=  n^ 
M  =  n;  for  if  R  were  less  than  n  we  should  have  ^  =  0,  and  there- 
fore a  =  0  (since  ^  =  a"*"!).  But  this  is  impossible,  since  by  hypothesis 
r  =  n.     We  have  then  : 

Theorem  1.  If  the  rank  of  a  quadratic  form  in  n  variables  and  of 
its  adjoint  are  r  and  R  respectively^  then 

if  r  =  n^  R  =  n^ 

if  r  =  n  —  1,  i2  =  l, 
if  r  <n—l,  E  =  0. 

M 
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Let  us  consider  further  the  case  r  =  n  —  l.     Here  we  have  seen 
that  11=1,  that  is,  that  the  adjoint  is  the  square  of  a  linear  form, 


71  n  n 


Comparing  coefficients,  we  see  that 


All  the  c's  cannot  be  zero,  as  otherwise  we  should  have  ^  =  0.  Let 
C),:=^0.     Then  since  ^    _^2^q 

we  see  that  not  all  the  quantities  {A^-^,  •••  JLx„)  are  zero.  Accord- 
ingly (cf.  §44)  the  point  {A^-^^  A^^^  •••  A^^^  and  therefore  also  the 
point  (cy,  •••  Cn)^  is  a  vertex  of  the  original  quadratic  form.  Thus  we 
have  the  theorem  : 

Theorem  2.  If  the  rank  of  a  quadratic  form  in  n  variables  is  n  —  \, 
its  adjoint  is  the  square  of  a  linear  form,  and  the  coefficients  of  this 
linear  form  are  the  coordinates  of  a  vertex  of  the  original  form. 

Since,  in  the  case  we  are  considering,  all  the  vertices  of  the 
quadratic  form  are  linearly  dependent  on  any  one,  this  theorem  com- 
pletely determines  the  linear  form  in  question  except  for  a  constant 
factor. 
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PAIRS  OF  QUADRATIC  FORMS 

56.  Pairs  of  Conies.  We  will  give  in  this  section  a  short  geomet- 
rical introduction  to  the  study  of  pairs  of  quadratic  forms,  confining 
ourselves,  for  the  sake  of  brevity,  to  two  dimensions. 

Let  u  and  v  be  two  conies  which  we  will  assume  to  be  so  situated 
that  they  intersect  in  four,  and  only  four,  distinct  points,  A^  B^  (7,  D. 
Consider  all  conies  through  these  four  points.     These  conies,  we  will 


say,  form  a  pencil.  It  is  obvious  that  there  are  three  and  only  three 
singular  conies  (i.e.  conies  which  consist  of  pairs  of  lines)  in  this 
pencil,  namely,  the  three  pairs  of  lines  AB,  CD ;  BC,  DA ;  AC,  BD. 
Let  us  call  the  "  vertices  "  of  these  conies  P,  Q^  and  R  respectively. 
From  the  harmonic  properties  of  the  complete  quadrilateral*  we 
see  that  the  secants  PAB  and  PCD  are  divided  harmonically  by  the 

*  Cf .  any  book  on  modern  geometry. 
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line  QR.  Accordingly  QR  is  the  polar  of  P  with  regard  to  every 
conic  of  the  pencil.  In  a  similar  manner  PR  is  the  polar  of  Q,  and 
PQ  the  polar  of  R  with  regard  to  every  conic  of  the  pencil.  Thus, 
we  see  that  the  triangle  PQR  is  a  self-conjugate  triangle  (see  §41) 
with  regard  to  every  conic  of  the  pencil.  Accordingly,  if  we  per- 
form a  collineation  which  carries  over  P,  Q^  R  into  the  origin  and 
the  points  at  infinity  on  the  axes  of  x  and  3/,  the  equation  of  every 
conic  of  the  pencil  will  be  reduced  to  a  form  in  which  only  the 
square  terms  enter.     We  are  thus  led  to  the  result : 

Theorem.  If  two  conies  intersect  in  four  and  only  four  distinct 
points^  there  exists  a  non-singular  collineation  which  reduces  their 
equations  to  the  normal  form 

A^xl  4-  A^xl  4-  A^xl  =  0, 
B^xl  +  b\xI  +  B^xl  =  0. 

If  we  wish  to  carry  through  this  reduction  analytically,  we  shall 
write  the  equations  of  the  two  conies  u  and  v  in  the  forms 

3  3 

(1)  ^ct^jXiXj  =  0,         ^hijXiXj  =  0. 

The  pencil  of  conies  may  then  be  written 

3 

(2)  t{%--^h)xiX^-  =  0, 

1 

or  rather,  to  be  accurate,  this  equation  will  represent  for  different 
values  of  \  all  the  conies  of  the  pencil  except  the  conic  v.  The 
singular  conies  of  the  pencil  will  be  obtained  by  equating  the 
discriminant  of  (2)  to  zero. 


(3) 


^11  ~^^11  ^12  ~"  ^^12 

6*0-1  \0n-i  ^92  ^~'  ^22  v*oo  ~~  A-CoQ     — ~  "• 

^31  ■"  ^^31  ^32  ~  ^^32 


^13 

-X5i3 

^23 

-^^23 

%3' 

-X^>33 

This   equation   we   will   call  the .  \-equation  of  the  two  conies. 
When  expanded,  it  takes  the  form 

(4)  -A'\3  +  @^X2-©X-|-A  =  0, 
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^11 

^12 

^13 

«11 

^12 

«13 

^1 

«12 

«13 

«21 

^22 

^23 

+ 

^21 

^22 

^23 

+ 

^21 

^2 

^23 

«31 

^32 

^33 

%1 

^32 

«33 

^31 

^32 

^33 

where  A,  A^  are  the  discriminants  of  u  and  v  respectively,  and 


@  = 


while  %'  can  be  obtained  from  @  by  an  interchange  of  the  letters  a 
and  h.  It  can  readily  be  proved  (cf.  the  next  section)  that  the  co- 
efficients @  and  %'  as  well  as  A  and  A'  are  invariants  of  weight  two. 

Except  when  the  discriminant  A'  of  v  is  zero,  the  equation  (4) 
is  of  the  third  degree,  and  its  three  roots,  which  in  the  case  we 
have  considered  must  evidently  be  distinct,  give,  when  substituted 
in  (2),  the  three  singular  conies  of  the  pencil. 

We  will  not  stop  here  to  show  how  the  theory  of  any  two 
conies,  where  no  restriction  as  to  the  number  of  points  of  intersec- 
tion is  made,  can  be  deduced  from  equation  (3).*  This  will  follow 
in  Chapter  XXII  as  an  application  of  the  method  of  elementary 
divisors.  Our  only  object  in  this  section  has  been  to  give  a  geo* 
metrical  basis  for  the  appreciation  of  the  following  sections. 

57.  Invariants  of  a  Pair  of  Quadratic  Forms.     Their  X-£quation. 

We  consider  the  pair  of  quadratic  forms 

n 

n 

and  form  from  them  the  pencil  of  quadratic  forms 

n 
^  -  Xl/r  =  S(a,y  -  Xbij)XiXj. 

The  discriminant  of  this  pencil, 


^m  ~  ^^«l  '  •  •  ^nn  ~~  ^^T 


^FiX), 


is  a  polynomial  in  \  which  is  in  general  of  degree  n,  and  which  may 
be  written  j^(^^  =  ©^  -  ©^X  -}-•••  +  (  -  IfS.W 

*  An  elementary  discussion  of  the  X-equation  of  two  conies  {Vequation  en  X)  is 
regularly  given  in  French  text-books  on  analytic  geometry.  See,  for  instance,  Briot 
et  Bouquet,  Legons  de  Geometrie  analytique,  14th  ed.,  p.  349,  or  Niewenglowski,  Cours 
de  Geometrie  anaJytiqne,  Vol.  I,  p.  459. 
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The  coefficients  of  this  polynomial  are  themselves  polynomials  in 
the  a^/s  and  5^/s,  ©o  and  (s)„  being  merely  the  discriminants  of  </>  and 
yjr  respectively,  while  ©^^  is  the  sum  of  all  the  different  determinants 
which  can  be  formed  by  replacing  k  columns  of  the  discriminant  of 
0  by  the  corresponding  columns  of  the  discriminant  of  yjr. 

Theorem  1.  The  coefficients  O^,  ••  •  0^  o/  F{X)  are  integral  rational 
invariants  of  weight  two  of  the  pair  of  quadratic  forms  0,  i/r.* 

In  order  to  prove  this,  let  us  consider  a  linear  transformation  of 
determinant  c  which  carries  over  <^  and  i/r  into  </>'  and  i^'  respec- 
tively, where  n 

n 

Let  us  denote  by  0J  the  polynomial  in  the  a^'s  and  J-^'s  obtained  by 
putting  accents  to  the  a's  and  J's  in  @^-.  Our  theorem  will  then  be 
proved  if  we  can  establish  the  identities 

0J  =  c20,  (^  =  0,l,  ..•^) 

This  follows  at  once  from  the  fact  that  -F(X),  being  the  discriminant 
of  </)  —  X-^/r,  is  an  invariant  of  weight  two,  so  that  if  we  denote  by 
F'{X)  the  discriminant  of  </>'  —  Xi/r',  we  have 

F\X)  =  c'^  F{X). 

This  being  an  identity  in  X  as  well  as  in  the  a'^  and  5's,  we  can 
equate  the  coefficients  of  like  powers  of  X  on  the  two  sides,  and  this 
gives  precisely  the  identities  we  wished  to  establish.  | 
The  equation  F(\^  =  Q 

we  will  call  the  X-equation  of  the  pair  of  forms  <^,  -^/r.  Since,  as  we 
have  seen,  F  is  merely  multiplied  by  a  constant  different  from  zero 
when  (/)  and  -v/r  are  subjected  to  a  non-singular  linear  transformation, 

*  Cf.  Exercise  13,  §  90. 

\  The  method  by  which  we  have  here  arrived  at  invariants  of  the  system,  of  two 
quadratic  forms  will  be  seen  to  be  of  very  general  application.  If  we  have  an  integral 
rational  invariant  I  of  weight  /*  of  a  single  form  of  the  A:th  degree  in  n  variables,  we  can 
find  a  large  number  of  invariants  of  the  system  0i,  02,  •••  <t>p  of  p  forms  of  the  A:th  degree 
in  n  variables  by  forming  the  invariant  /for  the  form  Xi0i  +  •••  +\p<t>p-  This  will  be 
a  polymonial  in  the  X's,  each  of  whose  coefficients  is  seen,  precisely  as  above,  to  be  an 
integral  rational  invariant  of  the  systems  of  0's  of  weight  /*• 
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the  roots  of  the  X-equation  will  not  be  changed  by  such  a  transfor- 
mation. These  roots,  however,  are  irrational  functions  of  the  ©'s  and 
hence  of  the  a's  and  5's.     We  may  therefore  state  the  result : 

Theorem  2.  The  roots  of  the  \-equation  of  a  pair  of  quadratic 
forms  are  absolute  irrational  invariants  of  this  pair  of  forms  with 
regard  to  non- singular  linear  transformations. 

It  is  clear  that  the  multiplicity  of  any  root  of  the  X-equation 
will  not  be  changed  by  a  non-singular  linear  transformation.    Hence 

Theorem  3.  The  multiplicities  of  the  roots  of  the  \-equation  are 
arithmetical  invariants  of  the  pair  of  quadratic  forms  with  regard  to 
non-singular  linear  transformations. 

If  </>  =  a^x\+  ...  +  a^xl, 

^=     x\-\-  •"  -\-  x^ 


'n't 


the  roots  of  the  X-equation  are  a^,  ••.  a^.  This  example  shows 
that  the  absolute  invariants  of  Theorem  2  may  have  any  values, 
and  also  that  the  arithmetical  invariants  of  Theorem  3  are  subject 
to  no  other  restriction  than  the  obvious  one  of  being  positive  in- 
tegers whose  sum  is  n. 

58.  Reduction  to  Normal  Form  when  the  \-Equation  has  no  Multi- 
ple Roots.  Although  our  main  concern  in  this  section  is  with  the 
case  in  which  the  X-equation  has  no  multiple  roots,  we  begin  by  estab- 
lishing a  theorem  which  applies  to  a  much  more  general  case. 

Theorem  1.  If  \^  is  a  simple  root  of  the  \- equation  of  the  pair 
(f),  yjr  of  quadratic  forms  in  n  variables.,  then.,  by  a  non-singular  linear 
transformation.,  </>  and  yjr  can  be  reduced  respectively  to  the  forms 

where  c^  is  a  constant  not  zero  and  <^j,  yfr^  are  quadratic  forms  in  the 
n—  1  variables  z^.,  •  •  •  z,^. 

To  prove  this,  we  will  consider  the  pencil  of  forms 
(2)  (f)-\ylr  =  cf>-  X^^fr  -f  (X^  -  \)yjr. 
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Since  X^  is  a  root  of  the  \-equation  of  the  pair  of  forms  </>,  i/r,  the  form 
(f)^\^|r  is  singular,  and  can  therefore,  by  a  suitable  non-singular  linear 
transformation,  be  written  in  a  form  in  which  one  of  the  variables,  say 
x[,  does  not  enter,  ^  -  \^ ,/r  =  </)'(4,  .  • .  x^^). 

If  this  transformation  reduces  yjr  to  yjr^  we  have 

(3)  (^  - X^  =  (/>'(4,  --x',)  +  (\  -  X)ylr\x[,  . . .  x',). 

The  discriminant  of  the  quadratic  form  which  stands  here  on  the 
right  cannot  contain  X^  —  \  as  a  factor  more  than  once,  since  Xj  is,  by 
hypothesis,  not  a  multiple  root  of  the  X-equation  of  <^  and  -i/r.  It 
follows  from  this  that  the  coefficient  of  x'^  in  the  quadratic  form  -i/r' 
cannot  be  zero,  for  otherwise  the  discriminant  of  the  right-hand  side 
of  (3)  would  have  a  zero  in  the  upper  left-hand  corner,  and  Xj  —  X 
would  be  a  factor  of  all  the  elements  of  its  first  row  and  also  of  its 
first  column  ;  so  that  it  would  contain  the  factor  (Xj  —  X)^. 

Since  the  coefficient  of  x'^  in  yjr'  is  not  zero,  we  can  by  Lagrange's 
reduction  (Formulae  (2),  (3),  §45)  obtain  a  non-singular  linear  trans- 
formation of  the  form 

h  =  7i4  +  72^  +  •••  +  7n4 

h 


Zrn  —  Xn 


which  reduces  yjr'  to  the  form 

This  transformation  carries  over  the  second  member  of  (3)  into 

<^X^2'  •••^n) +  (^1-^)^1  (^2'  •••«») +  (Xi-X)ci4 
Combining  these  two  linear  transformations  and  writing 

we  have  thus  obtained  a  non-singular  linear  transformation  which 
effects  the  reduction, 

<^(^r  •••^n)-^'^(^li-"^n)  =  <^l(^2'"-2^«)-^'fl(^2^---^n)4-(Xi-X)<?i2;f. 

If,  here,  we  equate  the  coefficients  of  X  on  both  sides,  and  the 
terms  independent  of  X,  we  see  that  we  have  precisely  the  reduc- 
tion of  the  forms  </>,  ^  to  the  forms  (1);  and  the  theorem  is  proved. 
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Let  us  now  assume  that  the  form  i/r  is  non-singular,  thus  insur- 
ing that  the  \-equation  be  of  degree  n.  We  will  further  assume 
that  the  roots  Xj,  Xg'  •"  ^n  of  this  equation  are  all  distinct.  We  can 
then,  by  the  theorem  just  proved,  reduce  the  forms  <^,  yjr  to  the  forms 
(1)  by  a  non-singular  linear  transformation.  The  \-equation  of  these 
two  forms  is  seen  to  differ  from  the  X-equation  of  the  pair  of  forms 
in  (n  —  1)  variables  c^^,  yjr^  only  by  the  presence  of  the  extra  factor 
Xj— \.  Accordingly  the  X-equation  of  the  pair  of  forms  <^j,  yjr^  has 
as  its  roots  Xg,  •••  X„  and  these  are  all  simple  roots.  We  may  there- 
fore apply  the  reduction  of  Theorem  1  to  the  two  forms  </)j,  yjr^  and 
thus  by  a  non-singular  linear  transformation  of  z^^  •••  z^  reduce  them 
to  the  forms  ^2  ^2  4^  +  02(4'  "*  ^»)' 

^24^  +  ^^2(4'  •••4)- 

This  linear  transformation  may,  by  means  of  the  additional  formula 

be  regarded  as  a  non-singular  linear  transformation  of  2j,  •  •  •  s„  which 
carries  over  </>,  \jr  into  the  fot*ms 

\e^z'^  -f  X2C242  -f  (^2(4'  •••  4> 

^i4^+    ^24^ +'^2(4'  •••4> 

Proceeding  in  this  way,  we  establish  the  theorem: 

Theorem  2.  If  cji^yjr  are  quadratic  forms  in  (a^j,  •  •  •  x^)  of  which  the 
second  is  non-singular^  and  if  the  roots  X^,  •  ••  X^  of  their  \-equation  are  all 
distinct^  there  exists  a  non- singular  linear  transformation  which  carries 
over  <l>  and  ^  into      ^^ ^^^^2  +  x^c^x'^  +  •  •  •  -f  X„c„4^ 

/•     '>«'2      I  /»     T  A       I         .  .  .        I  />     /y'it 

11      '  ^2     2"^  ^        ^n'^n 

respectively^  where  c^^---  c^  are  constants  all  different  from  zero. 

Since  none  of  the  c's  are  zero,  the  linear  transformation 

x^=V^x!,  (^  =  l,  2,  ...w) 

is  non-singular.     Performing  this  transformation,  we  get  the  further 
result: 

Theorem  3.  Under  the  same  conditions  as  in  Theorem  2,  <f>  and 
yjr  may  he  reduced  by  means  of  a  non-singular  linear  transformation  to 
the  normal  forms  „^^2  +  x^rri  +  ...  +  X„xl, 

x\Jr      xl+  ■■•  +      xl- 
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From  this  we  infer  at  once 

Theorem  4.  If  in  the  two  pairs  of  quadratic  forms  <^,  ^/r  and  cj^'^ 
yjr'  the  forms  yjr  and  'yjr'  are  both  non-singular^  and  if  the  X-equations  of 
these  two  pairs  of  forms  have  no  multiple  roots^  a  necessary  and  suf- 
ficient condition  for  the  equivalence  of  the  two  pairs  of  forms  is  that 
these  two  \-equations  have  the  same  roots  ;  or,  what  amounts  to  the  same 
thing,  that  the  invariants  O^,  @ j,  •••©„(?/  the  first  pair  of  forms  he  pro- 
portional to  the  invariants  @q,  @j,  •  •  •  @Jj  of  the  second. 

EXERCISE 

Prove  that,  under  the  conditions  of  Theorem  3,  the  reduction  to  the  normal 
form  can  be  performed  in  essentially  only  one  way,  the  only  possible  variation 
consisting  in  a  change  of  sign  of  some  of  the  x's  in  the  normal  form. 

59.  Reduction  to  Normal  Form  when  \|/  is  Definite  and  Non- 
singular.  We  now  consider  the  case  of  two  real  quadratic  forms 
<^,  i/r  of  which  i/r  is  definite  and  non-singular.  Our  main  problem  is 
to  reduce  this  pair  of  forms  to  a  normal  form  by  means  of  a  real 
linear  transformation.     For  this  purpose  we  begin  by  proving 

Theorem  1.  The  X-equation  of  a  pair  of  real  quadratic  forms 
(f>,  yfr  can  have  no  imaginary  root  if  the  form  yjr  is  definite  a7id  non- 
singular. 

For,  if  possible,  let  a  -f-  /3^  {a  and  ^  real)  be  an  imaginary  root  of 
this  X-equation,  so  that  fi  =^0.  Then  (fy  —  ayjr  —  i^-KJr  will  be  a  singular 
quadratic  form,  and  can  therefore  be  reduced  by  a  non-singular 
linear  transformation 

^i  =  (Pll  +  mi)  ^1+  '"  +  (Pln  +  mn)  ^n. 


^n  =  {Pnl  +  ^^nX)^l  +   ••*  +  {Pnn  +  iqnn)^n 

to  the  sum  of  h  squares,  where  k  <  n, 

(1)  <|>-a^|r-i|3^jr  =  x[^^\-x!^^-\-  "'  -{-x'k^. 

Let 

(2)  I/l=Pn^l-^  •"  +Pln^n^ 

(3)  zi  =  qi^x^  4-  •••  -\-qin^n^ 
so  that  x'l  =  yi+  izi . 
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By  equating  the  coefficients  of  i  on  the  two  sides  of  (1)  we  thus  get 
(4)  -/3i/r  =  2^1^14- 2^2^2  4-  ••• +2y,2,. 

Let  us  now  determine  x^,  •••  Xn  so  as  to  make  the  right-hand  side 
of  (4)  vanish,  for  instance  by  means  of  the  equations 

A  reference  to  (2)  shows  that  we  have  here  a  system  of  k  real 
homogeneous  linear  equations  in  n  unknowns,  so  that  real  values  of 
x^,  •"  i?;^not  all  zero  can  be  found  satisfying  these  equations.  For 
these  values  of  the  variables,  we  see  from  (4)  that  yjr  vanishes ;  but 
this  is  impossible  (cf .  the  Corollary  of  Theorem  3,  §  52),  since  yjr  is  by 
hypothesis  non-singular  and  definite. 

Theorem  2.  If  '^  is  a  non-singular  definite  quadratic  form  and  <j> 
is  any  real  quadratic  form^  the  pair  of  forms  0,  -v/r  can  he  reduced  hy  a 
real  non-singular  linear  transformation  to  the  normal  form 


(5) 


where  \j,  •  •  •  X„  are  the  roots  of  the  X-equation^  and  the  upper  or  lower 
sign  is  to  he  used  in  hoth  cases  according  as  yjr  is  a  positive  or  a  negative 
form. 

The  proof  of  this  theorem  is  very  similar  to  the  proof  of 
Theorem  2,  §  58.  We  must  first  prove,  as  in  Theorem  1,  §  58, 
that  </),  yjr  can  be  reduced  by  a  real  non-singular  linear  transforma- 
tion to  the  forms  r 

(6)  (^1=^^) 

To  prove  this,  we  consider  the  pencil  of  forms 

Since  X^  is  real  by  Theorem  1,  <^  —  X^i/r  is  a  real  singular  quad- 
ratic form,  and  can  therefore  by  a  real  non-singular  linear  trans- 
formation be  reduced  to  a  form  in  which  one  of  the  variables  does 
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If  this  transformation  reduces  yjr  to  yjr',  we  have 

(7)  <l>-^f  =  <l>\4."-<)  +  (\-xH\^'v"^l)' 

At  this  point  comes  the  essential  difference  between  the  case 
we  are  now  considering  and  the  case  considered  in  §  58,  as  \  may 
now  be  a  multiple  root  of  the  discriminant  of  the  right-hand  side 
of  (7).  We  need,  then,  a  diff'erent  method  for  showing  that  the 
coefficient  of  x'^  in  yjr'  is  not  zero.  For  this  purpose  it  is  sufficient 
to  notice  that  -v/r,  and  therefore  also  yfr^  is  a  non-singular  definite 
form,  and  that  accordingly,  by  Theorem  4,  §  52,  the  coefficient  of 
none  of  the  square  terms  in  yjr'  can  be  zero. 

Having  thus  shown  that  the  coefficient  of  x'^  in  yjr'  is  not  zero, 
we  can  apply  Lagrange's  reduction  to  yjr'^  and  thus  complete  the 
reduction  of  the  forms  </>,  i/r  to  the  forms  (6)  precisely  as  in  the  proof 
of  Theorem  1,  §  58,  noticing  that  the  transformation  we  have  to  deal 
with  is  real. 

In  (6),  <^i,  -v/tj  are  real  quadratic  forms  in  the  n  —  1  variables 
^2,  •••  ^n'     Moreover,  since 

is  non-singular  and  definite,  it  follows  that  the  same  is  true  of  ^fr^ 
For,  if  yJTi  were  either  singular  or  indefinite,  we  could  find  values 
of  ^2'  •  •  •  z^  not  all  zero  and  such  that  '\jr^  =  0;  and  these  values  to- 
gether with  the  value  Sj  =  0  would  make  '\/r  =  0.  This,  however,  is 
impossible  by  the  Corollary  of  Theorem  3,  §  52. 

The  X-equation  of  the  two  forms  <^j,  yjr^  evidently  differs  from 
the  \-equation  of  <^,  i/r  only  by  the  absence  of  the  factor  X  —  \.  The 
roots  of  the  \-equation  of  <^j,  yjr^  are  therefore  Xg^  *•*  ^ni  so  that  if  we 
reduce  (fy^  and  yjr^  by  the  method  already  used  for  <^,  yjr  (we  have  just 
seen  that  <j>^,  yjr^  satisfy  all  the  conditions  imposed  on  <^,  i/r),  we  get 

<^l(^2'  '•'Zn)  =  \  ^2^2^  +  4>2(4^  ' ' '  ^«> 
V^l(^2'  "'^n)=        ^2^2^  +  ^2(4'  •  •  •  ^n)' 

Proceeding  in  this  way,  we  finally  reduce  <^,  -^  by  a  real  non-singu- 
lar linear  transformation  to  the  forms 


ill 


l/r=        C^^l-\-'-'+  C^yl 
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Since  yjr  is  definite,  the  constants  c^,  •  •  •  c„  are  all  positive  or  all  nega- 
tive according  as  -^/r  is  a  positive  or  a  negative  form.  By  means  of 
the  further  non-singular  real  linear  transformation 

^'i  =  ^[^\l/i  (^=l,  2,  ...n), 

the  forms  (8)  may  be  reduced  to  the  forms  (5),  and  our  theorem  is 
proved. 

EXERCISES 

1.  If  <^  is  a  real  quadratic  form  in  n  variables  of  rank  r,  prove  that  it  can  be 
reduced  by  a  real  orthogonal  transformation  in  n  variables  to  the  form 

CjXl  +  C^xl-\ hCrXl. 

Cf .  Exercises,  §  52. 

2.  Show  that  the  determinant  of  the  orthogonal  transformation  of  Exercise  1 
may  be  taken  at  pleasure  as  +  1  or  —  1. 

3.  Discuss  the  metrical  classification  of  real  quadric  surfaces  along  the 
following  lines: 

Assume  the  equation  in  non-homogeneous  rectangular  coordinates,  and  show 
that  by  a  transformation  to  another  system  of  rectangular  coordinates  having  the 
same  origin  the  equation  can  be  reduced  to  a  form  where  the  terms  of  the  second 
degree  have  one  or  the  other  of  the  five  forms  (the  A 's  being  positive  constants) 


A. 

1^1 

■^A, 

A-^ 

A.. 

2 

A 

2 
1^1 

+  ^2 

<xl  — 

Ae 

2 

A, 

-VA, 

2 
}^2> 

A 

2 
1^1 

-^. 

J  ^25 

A 

2 
iXi. 

Then  simplify  each  of  the  non-homogeneous  equations  thus  obtained  by  further 
transformations  of  coordinates;  thus  getting  finally  the  standard  forms  of  the 
equations  of  ellipsoids,  hyperboloids,  paraboloids,  cones,  cylinders,  and  planes 
■which  are  discussed  in  all  elementary  text-books  of  solid  analytic  geometry. 


CHAPTER  XIV 

SOME   PROPERTIES   OF   POLYNOMIALS   IN   GENERAL 

60.  Factors  and  Reducibility.  In  the  present  section  we  will 
introduce  certain  conceptions  of  fundamental  importance  in  our 
subsequent  work. 

Definition  1.  By  a  factor  or  divisor  of  a  polynomial  f  in  any 
number  of  variables  is  understood  a  polynomial  cj)  which  satisfies  an 
identity  of  the  form  ^  _   »  • 

-i/r  being  also  a  polynomial. 

It  will  be  noticed  that  every  constant  different  from  zero  is  a 
factor  of  every  polynomial ;  that  every  polynomial  is  a  factor  of  a 
polynomial  which  vanishes  identically ,  while  a  polynomial  which 
is  a  mere  constant,  different  from  zero,  has  no  factors  other  than 
constants. 

We  note  also  that  a  polynomial  in  a;^,  ••  •  x^  which  is  not  identically 
zero  cannot  have  as  a  factor  a  polynomial  which  actually  contains 
any  other  variables. 

The  conception  of  reducibility,  which  we  have  already  met  in 
a  special  case  (§47),  we  define  as  follows: 

Definition  2.  A  polynomial  is  said  to  be  reducible  if  it  is  iden- 
tically equal  to  the  product  of  two  polynomials  neither  of  which  is  a 
constant. 

In  dealing  with  real  polynomials,  a  narrower  determination  of 
the  conception  of  reducibility  is  usually  desirable.  We  consider, 
then,  what  we  will  call  reducibility  in  the  domain  of  reals.,  a  con- 
ception which  we  define  as  follows: 

Definition  3.  A  real  polynomial  is  said  to  be  reducible  in  the 
domain  of  reals  if  it  is  identically  equal  to  the  product  of  two  other 
real  polynomials  neither  of  which  is  a  constant. 

174 
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In  many  branches  of  algebra  still  another  modification  of  the 
conception  of  reducibility  plays  an  important  part.  In  order  to 
explain  this,  we  first  lay  down  the  following  definition: 

Definition  4.  A  set  of  numbers  is  said  to  form  a  domain  of 
rationality  if  when  a  and  h  are  any  numbers  of  the  set,  a  -\-  b, 
a—b,  ab,  and,  so  far  as  5^0,  a/b  are  also  numbers  of  the  set. 

Thus  all  numbers,  real  and  imaginary,  form  a  domain  of  ration- 
ality, and  the  same  is  true  of  all  real  numbers.  The  simplest  of  all 
domains  of  rationality,  apart  from  the  one  which  contains  only  the 
single  number  zero,  is  what  is  known  as  the  nataral  domain,  that  is  all 
rational  numbers,  positive  and  negative.  A  more  complicated  domain 
of  rationality  would  be  the  one  consisting  of  all  numbers  of  the  form 
a  +  5  V  —  1,  where  a  and  b  are  not  merely  real,  but  rational.  These 
illustrations,  which  might  be  multiplied  indefinitely,  should  suffice  to 
make  the  scope  of  the  above  definition  clear.* 

Definition  5.  A  polynomial  all  of  whose  coefficients  lie  in  a 
domain  of  rationality  R  is  said  to  be  reducible  in  this  domain  if  it  is 
identically  equal  to  a  product  of  two  polynomials,  neither  of  which  is  a 
constant,  whose  coefficients  also  lie  in  this  domain. 

It  will  be  noticed  that  Definitions  2  and  3  are  merely  the  special 
cases  of  this  definition  in  which  the  domain  of  rationality  is  the 
domain  of  all  numbers,  and  the  domain  of  all  reals  respectively.  To 
illustrate  these  three  definitions,  we  note  that  the  polynomial  x^-\-l 
is  reducible  according  to  Definition  2,  since  it  is  identically  equal  to 
(x  +  V  — 1)  (x—  V  —  1).  It  is,  however,  not  reducible  in  the  domain 
of  reals,  nor  in  the  natural  domain.  On  the  other  hand,  x^  —  2  is 
reducible  in  the  domain  of  reals,  but  not  in  the  natural  domain. 
Finally,  x^  —  4:  is  reducible  in  the  natural  domain. 

Leaving  these  modifications  of  the  conception  of  reducibility,  we 
close  this  section  with  the  following  two  definitions  : 

Definition  6.  Two  polynomials  are  said  to  be  relatively  prime  if 
they  have  no  common  factor  other  than  a  constant. 

*By  i?  (ai,a2)  •••  «n)  is  understood  the  domain  of  rationality  consisting  of  all 
numbers  which  can  be  obtained  from  the  given  numbers  a\,  •♦•  an  by  the  rational  pro- 
cesses (addition,  subtraction,  multiplication,  and  division).  In  this  notation  the  natural 
domain  would  be  most  simply  denoted  by7?(l);  the  domain  last  mentioned  in  the 
text  by  jR  (1,  V  — l)or,  even  more  simply,  by  i?  (V  — 1).  This  notation  would  not  apply 
to  all  cases  {e.g.  the  real  domain)  except  by  the  use  of  an  infinite  number  of  arguments. 
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Definition  7.  Two  methods  of  factoring  a  polynomial  shall  he 
said  to  he  not  essentially  different  if  there  are  the  same  numher  n  of 
factors  in  each  case^  and  these  factors  can  he  so  arranged  that  the  kth 
factors  are  proportional  for  all  values  of  k^  from  1  to  n  inclusive. 

EXERCISES 

1.  Prove  that  every  polynomial  in  (x,  y)  is  irreducible  if  it  is  of  the  form 

/(^)  +  y, 

where  /  (x)  is  a  polynomial  in  x  alone. 

Would  this  also  be  true  for  polynomials  of  the  form 

/(x)+2/2? 

2.  lif,<f>,{l/  are  polynomials  in  any  number  of  variables  which  satisfy  the 
relation  f=<i>^, 

and  if  the  coefficients  of  /and  <f>  lie  in  a  certain  domain  of  rationality,  prove  that 
the  coefficients  of  if/  will  lie  in  the  same  domain  provided  (f>  ^0. 

61.  The  Irreducibility  of  the  General  Determinant  and  of  the  Sym- 
metrical Determinant. 

Theorem  1.     The  determinant 


D  = 


^21  ^22  ***  ^2« 


^nl  ^n2  '"  ^nn 


is  an  irreducihle  polynomial  if  its  n^  elements  are  regarded  as  inde- 
pendent var'tahles. 


For  suppose  it  were  reducible,  and  let 


where  neither/ nor  <^  is  a  constant.  Expanding  D  according  to  the 
elements  of  the  first  row,  we  see  that  it  is  of  the  first  degree  in  a^. 
Hence  one  of  the  two  polynomials  /  and  <f>  must  be  of  the  first 
degree  in  a^j,  the  other  of  the  zeroth  degree.  Precisely  the  same 
reasoning  shows  that  if  a,^  is  any  element  of  2>,  one  of  the  polyno- 
mials /  and  </>  will  be  of  the  first  degree  in  a,;^-,  while  the  other  will 
not  involve  this  variable. 

Let  us  denote  by  /  that  one  of  the  two  polynomials  which  involves 
a^i,  any  element  of  the  principal  diagonal  of  D.     Then  <^  does  not 
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involve  any  element  of  the  ith  row  or  the  ^th  column.  For  if  it  did, 
since/  is  of  the  first  degree  in  a^  and  (f)  is  of  the  zeroth,  their  product 
D  would  involve  terms  containing  products  of  the  form  a^^aij  or 
a^aji,  which,  from  the  definition  of  a  determinant,  is  impossible. 
Consequently,  if  either  one  of  the  polynomials  /  and  <^  contains  any 
element  of  the  principal  diagonal  of  2),  it  must  contain  all  the  ele- 
ments standing  in  the  same  row  and  all  those  standing  in  the  same 
column  with  this  one,  and  none  of  these  can  occur  in  the  other 
polynomial. 

Now  suppose/ contains  %  and  that  cf)  contains  any  other  element 
of  the  principal  diagonal,  say  ajj.  Then  a^j  and  aji  can  be  in  neither/ 
nor  <^,  which  is  impossible.  Hence,  if /contains  any  one  of  the  ele- 
ments in  the  principal  diagonal,  it  must  contain  all  the  others,  and 
hence  all  the  elements,  and  our  theorem  is  proved. 

Theorem  2.      The  symmetrical  determinant 


I)  = 


a 


11 


a 


in 


a 


«i 


a 


(%  =  %i) 


is  an  irreducible  polynomial  if  its  ^n(n-\-Y)  elements  he  regarded  as 
independent  variables. 

The  proof  given  for  the  last  theorem  holds,  almost  word  for  word, 
in  this  case  also,  the  only  difference  being  that  while  D  is  of  the  first 
degree  in  each  of  the  elements  of  its  principal  diagonal,  it  is  of  the 
second  degree  in  each  of  the  other  elements.  The  slight  changes  in 
the  proof  made  necessary  by  this  difference  are  left  to  the  reader. 


EXERCISES 

1.   The  general  bordered  determinant 

flu  ...  ai„     1/11  '"  u\p 


Cln\  .  .  .  Ctfin     ^nl  •  •  •  ^np 

yii  . . .  vi„        0  ...  0 


Vpi...  Vpn         0  ...  0 

is  irreducible  if  />  <  n,  the  a's,  w's,  and  v's  being  regarded  as  independent  variables. 

2.   The  symmetrical  bordered  determinant  obtained  from  the  determinant  in 
Exercise  1  by  letting  a<j-  =  a,-,-,  Uij  =  v^-,-  is  irreducible  if  ^  <  n. 

N 
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3.    If  for  certain  values  of  i  and  J,  but  not  for  all,  Qij  =  Qji,  but  if  the  a's  are 
otherwise  independent,  can  we  still  say  that 

Oil  .  . .  Cl\n 


is  irreducible  ? 


«„i 


4.    Prove  that  a  skew-symmetric  determinant  (cf.  Exercises  2,  3,  §20)  is  always 
reducible  by  showing  that,  when  it  is  of  even  order,  it  is  a  perfect  square. 
[Suggestion.     Use  Corollary  3,  §  11,  and  Theorem  6  and  Exercise  1,  §  76.] 
Does  this  theorem  require  any  modification  if  the  elements  are  real  and  we 
consider  reducibility  in  the  domain  of  reals  ? 

62.  Corresponding  Homogeneous  and  Non-Homogeneous  Polyno- 
mials. It  is  often  convenient  to  consider  side  by  side  two  polyno- 
mials, one  homogeneous  and  the  other  non-homogeneous,  which 
bear  to  one  another  the  same  relation  as  the  first  members  of  the 
equations  of  a  plane  curve  or  of  a  surface  in  homogeneous  and  non- 
homogeneous  coordinates  respectively.  Such  polynomials  we  will 
speak  of  as  corresponding  to  one  another  according  to  the  following 
definition: 

Defhstition.  If  we  have  a  non-homogeneous  polynomial  of  the  kth 
degree  in  any  number  of  variables  {x-^^  •••^w-i)  and  form  a  new  poly- 
nomial by  multiplying  each  term  of  the  old  by  the  power  of  a  new 
variable  Xn  necessary  to  bring  up  the  degree  of  this  term  to  k^  the  homo- 
geneous polynomial  thus  formed  shall  be  said  to  correspond  to  the  given 
non-homogeneous  polynomial. 

Thus  the  two  polynomials 

(1)  2x^  +  ^x^y-bxz^^yz  +2z^  -^rx    -  3y    -9, 

(2)  2a;3  +  3rrV  -^xz^-  yzt  -f  2zH  -f  xt^  -  %yt^  -  9^, 

correspond  to  each  other. 

It  may  be  noticed  that  if  <^  {xy,  •  •  •  x^--^  is  the  non-homogeneous 
polynomial  of  degree  A?,  the  corresponding  homogeneous  polynomial 
may  be  written  /^     ^         ^ 


'n     "^n 


To  every  non-homogeneous  polynomial  there  corresponds  one,  and 
only  one,  homogeneous  polynomial.  Conversely,  however,  to  a 
homogeneous  polynomial  in  n  variables  there  correspond  in  general 
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n  different  non-homogeneous  polynomials  which  are  obtained  by  set- 
ting one  of  the  variables  equal  to  1.  For  instance,  in  the  example 
given  above,  to  (2)  corresponds  not  only  (1)  but  also 

(3)  2     +3^     -5^2  _y^t^1zH-\-t^  _3^f2_9^3^ 

(4)  2:1:3  _|_  3^2    _5^22_2;^    j^^^H^xt^-U^   -9^^ 

(5)  1:i^^-^x^y-bx    -yt   +2^     +^^2.3^^2  _  9^3. 

It  should  be  noticed,  however,  that  if  one  of  the  variables  enters  into 
every  term  of  a  homogeneous  polynomial,  the  result  of  setting  this 
variable  equal  to  unity  is  to  give,  not  a  corresponding  non-homo- 
geneous polynomial,  but  a  polynomial  of  lower  degree.  In  fact,  in 
the  extreme  case  in  which  every  variable  enters  into  every  term  of 
the  homogeneous  polynomial,  there  is  no  corresponding  non-homo- 
geneous polynomial;  as,  for  instance,  in  the  case  of  the  polynomial 

x^yz  -j-  xy'^z  -f  xyz'^. 

Theorem  1.  If  one  of  two  corresponding  polynomials  is  reducible^ 
then  the  other  is^  also^  and  the  factors  of  each  polynomial  correspond  to 
the  factors  of  the  other. 

For  let  <i>{x-^^  " '  x^^  be  a  homogeneous  polynomial  of  degree 
(k  4-  Z),  and  suppose  it  can  be  factored  into  two  factors  of  degrees 
h  and  Z,  respectively. 

Now  suppose  the  corresponding  non-homogeneous  polynomial  in 
question  is  the  one  formed  by  setting  Xn  =  1.     We  have 

Since  by  hypothesis  the  degree  of  the  polynomial  on  the  left  is 
unchanged  by  this  operation,  neither  of  the  factors  on  the  right- 
hand  side  of  (6)  can  have  had  its  degree  reduced,  hence  neither 
of  the  factors  on  the  right  of  (7)  is  a  constant.  Our  non-homo- 
geneous polynomial  is  therefore  reducible  ;  and  moreover  the  two 
factors  on  the  right  of  (7),  being  of  degrees  k  and  I  respectively, 
are  precisely  the  two  functions  corresponding  to  the  two  factors  on 
the  right  of  (6). 
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Now,  let  ^k+i{^v  '"  ^n-i)  l>e  ^  non-homogeneous  polynomial,  and 

where  the  subscripts  denote  the  degrees  of  the  polynomials.  Let 
4^k+i'>  '^k+h  Xk+i  b^  ^^^  homogeneous  polynomials  corresponding  to 
4),  ^,  X.     Then  when  x^  ^  0, 

Multiplying  this  equation  by  a;*^'  we  have 

an  equation  which  holds  whenever  Xn  ^  0,  and,  therefore,  by  Theorem 
5,  §  2,  is  an  identity.     Thus  our  theorem  is  proved. 

As  a  simple  illustration  of  the  way  in  which  this  theorem  may  be 
applied  we  mention  the  condition  for  reducibility  of  a  non-homo- 
geneous quadratic  polynomial  in  any  number  of  variables.  By 
applying  the  test  of  §  47  to  the  corresponding  homogeneous  poly- 
nomial we  obtain  at  once  a  test  for  the  reducibility  of  any  non- 
homogeneous  quadratic  polynomial. 

Theorem  2.  Iff  and  (f)  are  non-homogeneous  polynomials^  and 
-F,  <l>  are  the  corresponding  homogeneous  polynomials^  a  necessary  and 
sufficient  condition  that  F  and  ^  he  relatively  prime  is  that  f  and 
(j)  be  relatively  prime. 

For  iff  and  ^  have  a  common  factor  yjr  which  is  not  a  constant, 
the  homogeneous  polynomial  ^  which  corresponds  to  yjr  is,  by 
Theorem  1,  a  common  factor  of  JF  and  O,  and  is  clearly  not  a  con- 
stant. Conversely,  if  '^  is  a  common  factor  of  F  and  ^  which  is 
not  a  constant, /and  </>  will  have,  by  Theorem  1,  a  common  factor 
which  corresponds  to  "^  and  which  therefore  cannot  be  a  mere 
constant. 

63.  Division  of  Polynomials.  We  will  consider  first  two  polyno- 
mials in  one  variable: 


(1) 


f{x)  =  a^x""  +  a^a;^-!  +  -"-ha^, 
<^  (x)  =  h^x"^  -}-  h^x"^-^  +  •  •  •  +  *m. 
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We  learn  in  elementary  algebra  how  to  divide/ by  <^,  getting  a 
quotient  Q  (x)  and  a  remainder  M  (a;).  What  is  essential  here  is 
contained  in  the  following  theorem : 

Theorem  1.  If  f  and  </>  are  two  polynomials  in  x  of  which  <f)  is 
not  identically  zero^  there  exists  one^  and  only  one,  pair  of  polynomials, 
Q  and  R,  which  satisfy  the  identity 

(2)  f{x)=Q{x)<\>{x)  +  B{x), 

and  such  that  either  lt=0,*  or  the  degree  of  R  is  less  than  the  degree  of(f). 

We  begin  by  proving  that  at  least  one  pair  of  polynomials  Q,  R 
exists  which  satisfies  the  conditions  of  the  theorem. 

If/ is  of  lower  degree  than  </>  (or  if/=0),  the  truth  of  this  state- 
ment is  obvious,  for  we  may  then  let  Q  =  0,  R  =/. 

Suppose,  then,  that /is  of  at  least  as  high  degree  as  (j).  Writing 
/  and  (f)  in  the  form  (1),  we  may  assume 

a^^O,      ^0  ^  0,     n> m. 

Lemma.  If  (j>  is  not  of  higher  degree  thanf  there  exist  two  polyno- 
mials Qi  and  R^  which  satisfy  the  identity 

f(x)=Q,{x)<t>{x)-hR,{x), 
and  such  that  either  i^j  =  0,  or  the  degree  of  R^  is  less  than  the  degree  off 
The  truth  of  this  lemma  is  obvious  if  we  let 

Oq 

These  two  polynomials  Q^  and  R^  will  serve  as  the  polynomials  Q 
and  R  of  our  theorem  if  i^j=0,  or  if  the  degree  of  R^  is  less  than  the 
degree  of  <^.  If  not,  apply  the  lemma  again  to  the  two  functions  R-^ 
and  4,,  getting  ^^  ^^^  ^  q^  ^^^  ^  ^^^  ^  ^^  ^^^^ 

where  R^  is  either  identically  zero  or  is  of  lower  degree  than  R^,  We 
may  then  write,     f^^^  ^  ^  q^  (^^  ^  q^  ^^y^  ^  (^^  ^  ^^  (^^ 

If  R^  =  0,  or  if  the  degree  of  R^  is  less  than  the  degree  of  (/>,  we  may 
take  for  the  polynomials  Q,  R  of  our  theorem,  Q^  -\-  Q^diud  R^.  If 
not,  we  apply  our  lemma  again  to  R^  and  0.     Proceeding  in  this  way 

*  It  will  be  remembered  that,  according  to  the  definition  we  have  adopted,  a  polyno- 
mial which  vanishes  identically  has  no  degree. 
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we  get  a  series  of  polynomials  R^^  R^^,  •••  whose  degrees  are  con- 
stantly decreasing.  We  therefore,  after  a  certain  number  of  steps, 
reach  a  polynomial  Ri  which  is  either  identically  zero  or  of  degree 
less  than  <^.     Combining  the  identities  obtained  up  to  this  point,  we 

l^ave  y  (^)  =  [  (2j  (^)  +  . . .  +  (J.  {x)  ^<i>{x)  +  R,  {x), 

an  identity  which  proves  the  part  of  our  theorem  which  states  that 
at  least  one  pair  of  polynomials  Q,  R  of  the  kind  described  exists.* 

Suppose  now  that  besides  the  polynomials   Q,  R  of  the  theorem 
there  existed  a  second  pair  of  polynomials  Q\  R'  satisfying  the  same 
conditions.     Subtracting   from    (2)   the  similar   identity   involving 
§',  R\  we  have 
(3)  0  =  (Q-Q')(l>-h(R-R'). 

From  this  we  infer,  as  was  to  be  proved, 

Q=Q',  R  =  R', 

For  if  Q  and  Q^  were  not  identical,  the  first  term  on  the  right  of  (3) 
would  be  of  at  least  the  mth  degree,  while  the  second  involves  no 
power  of  X  as  high  as  m. 

Turning  now  to  polynomials  in  several  variables : 

<f>(x^,  '•'Xk)  =  h^lx^,  . . .  Xj^x'l  +  h-^{x^,  ' ' .  a:^>f -1  H h bj^x^,  •  •  •  a;^), 

the  ordinary  method  of  dividing/  by  <^  would  give  us  as  quotient  and 
remainder,  not  polynomials,  but  fractional  rational  functions.  In 
order  to  avoid  this,  we  state  our  theorem  in  the  following  form  : 

Theorem  2.  If  f  and  </>  are  polynomials  in  {x^^  •••  x^)  of  ivhich  <j) 
is  not  identically/  zero,  there  exist  polynomials  Q,  R,  P,  of  which  the 
last  is  not  identically  zero  and  does  not  involve  the  variable  x-^^  which 
satisfy  the  identity, 

(5)     P{x^,  '"  Xj,)f{x^,  .-.  Xk)  =  Q(x^,  •••  x/^)(f){x^,  •••  X/^)  +  R(x^,  •••  x/^), 

and  such  that  either  R  =  0,  or  the  degree  in  x^of  R  is  less  than  the  degree 
in  x^  of  (j). 

The  proof  of  this  theorem  follows  the  same  lines  as  the  proof 
of  Theorem  1. 

*  The  reader  should  notice  that  the  process  just  used  is  merely  the  ordinary  process 
of  long  division. 
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If/  is  of  lower  degree  in  x^  than  (j>  (or  if /=  0),  the  truth  of  the 
theorem  is  obvious,  for  we  may  then  let  P  =  1,  §  =  0,  ^  =/. 

Suppose,  then,  that  /  is  of  at  least  as  high  degree  in  x^  as  <^. 
Writing /and  </>  in  the  form  (4),  we  may  assume 

t^Q  ^  0,     5q  ^  0,     n>  m. 

Lemma.  If  <f)  is  not  of  higher  degree  in  x^  than  f  there  exist  two 
polynomials  §j,  R^  which  satisfy  the  identity^ 

and  such  that  either  R^  =  0,  or  the  degree  of  R^  in  x-^  is  less  than  the 
degree  offin  Xy 

The  truth  of  this  lemma  is  obvious  if  we  let 

The  polynomials  Q^  R^  h^  will  serve  as  the  polynomials  Q.  i2,  P 
of  our  theorem  if  R^  =  0,  or  if  the  degree  of  R^  in  x-^  is  less  than  tlie 
degree  of  <f>  in  x^  If  not,  apply  the  lemma  again  to  the  two  functions 
R^  and  </>,  getting 

^o(^2'   •••  ^k)^l{^V   •••  ^k)=Q2(^V   '"  ^k)^{^V    '••  ^k)+^2(.^V   '•'  ^k\ 

where  R^  is  either  identically  zero  or  is  of  lower  degree  in  x-^  than  R^ 
We  may  then  write      j^y  _  ^j^^^  ^  q^^^  ^  j^^ 

If  7^2  —  ^'  ^^  i^  ^^®  degree  of  Rc^  in  x^  is  less  than  the  degree  of  <f>  in 
iCj,  we  may  take  for  the  polynomials  Q^  R,  P  of  our  theorem  the 
functions  h^Q^-\-  Q^^  R^^  5^.  If  not,  we  apply  our  lemma  again  to  R^ 
and  <^.  Proceeding  in  this  way,  we  get  a  series  of  polynomials 
Ry  i?2'  •••  whose  degrees  in  x^  are  constantly  decreasing.  We  there- 
fore, after  a  certain  number  of  steps,  reach  a  polynomial  Ri  which  is 
either  identically  zero,  or  of  degree  in  x-^  less  than  </>.  Combining 
the  identities  obtained  up  to  this  point,  we  have 

an  identity  which  proves  our  theorem,  and  which  also  establishes  the 
additional  result : 

CoROLLATiY.     The  polynomial  P  whose  existence  is  stated  in  our 
theorem  may  he  taken  as  a  power  of  5q. 
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We  note  that  it  would  obviously  not  be  correct  to  add  to  the 
statement  of  Theorem  2  the  further  statement  that  there  is  only 
one  set  of  polynomials  §,  i2,  P,  since  the  identity  (5)  may  be  multi- 
plied by  any  polynomial  in  (x^^  ■-•^n)  without  changing  its  form. 
Cf.,  however,  the  exercise  at  the  end  of  §  73.    . 

64.  A  Special  Transformation  of  a  Polynomial.  Suppose  that 
/(irj,a725:/:^3,^4)  is  a  homogeneous  polynomial  of  the  kth  degree  in 
the  homogeneous  coordinates  {x^,  x^^  x^,  x^^  so  that  the  equation 
f  =  0  represents  a  surface  of  the  A?th  degree.  If,  in  /,  the  term  in 
x^  has  the  coefficient  zero,  the  surface  passes  through  the  origin; 
and  if  the  term  in  x^  (or  x^-,  or  x^)  has  the  coefficient  zero,  the 
surface  passes  through  the  point  at  infinity  on  the  axis  of  a;^ 
(or  a^g,  01'  ^3)-  It  is  clear  that  these  peculiarities  of  the  surface 
can  be  avoided,  and  that,  too,  in  an  infinite  variety  of  ways,  by  sub- 
jecting the  surface  to  a  non-singular  collineation  which  carries  over 
any  four  non-complanar  points,  no  one  of  which  lies  on  the  surface, 
into  the  origin  and  the  three  points  at  infinity  on  the  coordinate 
axes.  It  is  this  fact,  generalized  to  the  case  of  n  variables,  which 
we  now  proceed  to  prove. 

Lemma.  If  f{x^^  "'^n)  ^^  ^  homogeneous  polynomial  of  the  kth 
degree  in  which  the  term  a;f„  is  wanting^  there  exists  a  non-singular 
linear  transformation  of  the  variables  (2;^,  •••  a;„)  which  carries  f  into  a 
new  form  /j,  in  which  the  term  in  xj^  has  a  coefficient  different  from 
zerOf  while  the  coefficients  of  the  kth  powers  of  the  other  variables  have 
not  been  changed. 

In  proving  this  theorem  there  is  obviously  no  real  loss  of  gener- 
ality in  taking  as  the  variable  2;^  the  last  of  the  variables  x^^. 
Let  us  then  consider  the  non-singular  transformation 

Xi^x[-\-aixl  {i=l,  "'  n  —  1). 


^n  — -^n 


This  transformation  carries  /  into 
and   evidently  does   not  change   the   coefficients   of   the   terms   in 

•^1?  •*'n-  1* 
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Now,  since  every  term  in/j,  except  the  term  in  x'^-,  contains  at  least 
one  of  the  variables  x'^,  •  •  •  icj,  _  j,  the  coefficient  of  the  term  in  x'l  will 

be 

/i(0,  •••  0,  1)  =  /(«!,  •••a^_i,  1). 

Our  lemma  will  therefore  be  proved  if  we  can  show  that  the 
constants  a^,  •••  a^_  j  can  be  so  chosen  that  this  quantity  is  not  zero. 

Let  us  take  any  point  (5j,  •  •  •  ^„)  for  which  h^^O  \  and  consider 
a  neighborhood  of  this  point  sufficiently  small  so  that  x^  does 
not  vanish  at  any  point  in  this  neighborhood.  Then,  since  / 
does  not  vanish  identically,  we  can  find  a  point  ((?i,  •••  e^)  in  this 
neighborhood  (so  that  e„^0)  such  that 

/(ci,  ...c„)^0. 

If  now  we  take  for  a^,  •••a„_j  the  values  c^/ c^-,  •••<?„-i/^»i  we  shall 
have,  since /is  homogeneous, 

/(«i,  •••««_!,  1)  T^O. 
Thus  our  lemma  is  provecl. 

Theorem  1.    If  f{x^^  •••^n)  *'^  ^  homogeneous  polynomial  of  the  kth 
degree^  there  exists  a  non-singular  linear  transformation  which  carries 
f  into  a  new  form  f^  in  which  the  terms  in  x\,  •••  x'^  all  have  coefficients 
different  from  zero. 

The  proof  of  this  theorem  follows  at  once  from  the  preceding' 
lemma.  For  we  need  merely  to  perform  in  succession  the  trans- 
formations which  cause  the  coefficients  first  of  a:^,  then  of  x^^  etc., 
to  become  different  from  zero,  and  which  our  lemma  assures  us 
will  exist  and  be  non-singular,  to  obtain  the  transformation  we 
want.  To  make  sure  of  this  it  is  necessary  merely  to  notice  that 
the  coefficient  of  x\  obtained  by  the  first  transformation  will  not 
be  changed  by  the  subsequent  transformations;  that  the  same 
will  be  true  of  the  coefficient  of  x^  obtained  by  the  second  trans- 
formation ;  etc. 

Theorem  2.  If  f{x-^.,  •••  x^  is  a  polynomial  of  the  kth  degree 
which  is  not  necessarily  homogeneous,  there  exists  a  non-singular 
homogeneous  linear  transformation  of  (rr^,  •••  x^  which  makes  this 
polynomial  of  the  kth  degree  in  each  of  the  variables  x[,  •••  x'^  taken 
separately. 
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If  /  is  homogeneous,  this  is  equivalent  to  Theorem  1.  If  /  is 
non-homogeneous,  we  may  write  it  in  the  form 

where  each  </>  is  a  homogeneous  polynomial  of  the  degree  indicated 
by  its  subscript  or  else  is  identically  zero.  We  need  now  merely  to 
apply  Theorem  1  to  the  homogeneous  polynomial  <f)j^,  which  is,  of 
course,  not  identically  zero. 

This  theorem,  and  therefore  also  Theorem  1,  which  is  merely  a 
special  case  of  it,  admits  the  following  generalization  to  the  case  of  a 
system  of  functions : 

Theorem  3.    If  we  have  a  si/ stem  of  polynomials 

of  degrees  k^^  h^^  •••  h^  respectively^  there  exists  a  non-singular  homo- 
geneous linear  transformation  which  makes  these  polynomials  of  degrees 
^v  "'  ^m  ^^  each  of  the  variables  x[,  •  •  •  x'^  taken  separately. 

This  theorem  may  be  proved  either  by  the  same  method  used  in 
proving  Theorems  1,  2 ;  or  by  applying  Theorem  2  to  the  product 

/1/2  '"Jm" 


CHAPTER   XV 

FACTORS  AND   COMMON   FACTORS  OF  POLYNOMIALS   IN  ONE 
VARIABLE   AND   OF   BINARY   FORMS 

65.  Fundamental  Theorems  on  the  Factoring  of  Polynomials  in 
One  Variable  and  of  Binary  Forms.  Theorem  2,  §  6  may  be  stated 
in  the  following  form  : 

Theorem  1.  A  polynomial  of  the  nth  degree  in  one  variable  is 
always  reducible  when  n'>\.  It  can  be  resolved  irito  the  product  of  n 
linear  factors  in  one,  and  essentially  in  only  one,  way. 

By  means  of  §  62  we  can  deduce  from  this  a  similar  theorem  in 
the  case  of  the  binary  form 

(  J-  )  aryV-i    ~f"   a-tX-t        Xn   "f"     •  •  •     ~J~    a^Xnt 

Let  us  first  assume  that  a^^^O.     Then  the  non-homogeneous  poly- 
nomial 

(2)  a^xl '\- a^xl-^  +  ••.  -t-  a„ 

corresponds  to  (1),  according  to  the  definition  of  §  62.     Factoring 

(2),  we  get 

or,  if  we  take  n  constants  a'l,  Wg '  *  *  *  "«  whose  product  is  «q, 

(3)  {a'lx^  -  aj)«2:i  -  4)  •••  (««^i  -  «« )' 

where  for  brevity  we  have  written 

aJJai  —  a\  (^  =  1,  2,  •  •  •  71). 

By  Theorem  1,  §  62,  we  now  infer  that  the  binary  form  (1)  is  identi- 
cally equal  to 

(4)  {a!{x^  -  ai^2)("2^i  -  "2^2)  •••  ("n^i  -  "«^2)- 
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Moreover,  there  cannot  be  any  way  essentially  different  from  this  of 
factoring  (1)  into  linear  factors,  for  if  there  were  we  should,  by 
setting  x^  =  1,  have  a  way  of  factoring  (2)  into  linear  factors  essen- 
tially different  from  (3).  Thus  our  theorem  is  proved  on  the  suppo- 
sition that  a^  =^  0. 

Turning  now  to  the  case  a^  =  0,  let  us  suppose  that 

a^  =     ...     =    a;t-l    =    0,    (X^  :^  0, 

where  k  <n.     The  form  (1)  then  has  the  form 
(5)  %a:;f-^+   ...  -f  a,^a:^, 

which  is  equal  to  the  product  of  k  factors  a^g  and  the  binary  form 

of  degree  n  —  k.  Since  the  first  coefficient  in  this  form  is  not  zero, 
it  can,  as  we  have  just  seen,  be  factored  into  n  —  h  linear  factors. 
Thus,  here  also,  we  see  that  the  binary  form  can  be  written  in  the 
form  (4),  the  only  peculiarity  being  that  in  this  case  h  of  the  con- 
stants a"  are  zero.  We  leave  it  to  the  reader  to  show  that  this 
factoring  can  be  performed  in  essentially  only  one  way.  This  being 
done,  we  have  the  result : 

Theorem  2.  A  binary  form  of  the  nth  degree  is  always  reduc- 
ible when  n  >  1.  It  can  be  resolved  into  the  product  of  n  linear 
factors  in  one^  and  essentially  only  one,  way, 

EXERCISES 

1.  Prove  that  every  real  polynomial  in  one  variable  of  degree  higher  than  two 
is  reducible  in  the  domain  of  reals,  and  can  be  resolved  into  irreducible  factors  in 
one,  and  essentially  only  one,  way. 

2.  Prove  the  corresponding  theorem  for  real  binary  forms. 

66.   The  Greatest   Common  Divisor  of  Positive  Integers.*      We 

will  consider  in  this  section  the  problem  of  finding  the  greatest  com- 
mon divisor  of  two  positive  integers  a  and  5,  which  has  the  closest 

*In  this  section  we  use  the  term  divisor  in  the  arithmetical  sense,  not  in  the 
algebraic  sense  defined  in  §  60.  An  integer  b  is  said  to  be  a  divisor  of  an  integer  a  if 
an  integer  c  exists  such  that  a  =  be. 
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analogy  with  the  algebraic  problem  of  the  next  section.     The  solu- 
tion of  this  problem,  which  was  given  by  Euclid,  is  as  follows : 

If  we  divide  ahy  b*  and  get  a  quotient  q^  and  a  remainder  r^,  we 
may  write  a  =  q,b  +  r^, 

where,  if  the  division  is  carried  out  as  far  as  possible,  we  have  r^<b. 
Then  divide  5  by  r  getting  a  quotient  q^  and  a  remainder  r^ 
which,  if  the  division  is  carried  out  as  far  as  possible,  is  less  than  r^. 
Proceeding  in  this  way,  we  get  the  following  system  of  equations, 
in  which,  since  the  remainders  r^,  rg,  •••  are  positive  integers  which 
continually  decrease,  we  ultimately  come  to  a  point  where  the  divi- 
sion leaves  no  remainder  : 

a  =  90^+  ri  r^  ^  h, 

b  =  q^r^  +  r2  ^^^  ^^^ 

^1  =  ^2^2  +  ^3 


(1) 


3  ^  '2' 


^p-2  ~"  5'p-i^p-r+  ^p  ^p  ^  ^p-i' 

Tp-i  =  9p^p  ^  <  ^P- 

From  the  first  of  these  equations  we  see  that  every  common  factor 
of  a  and  5  is  a  factor  of  r^;  from  the  second,  that  every  common 
factor  of  b  and  r^  is  a  factor  of  r^;  etc.;  finally,  that  every  common 
factor  of  rp_2  and  rp_j  is  a  factor  of  Vp.  Hence  everi/  common  factor 
of  a  and  b  is  a  factor  of  r^. 

On  the  other  hand,  we  see  from  the  last  equation  (1)  that  every 
factor  of  rp  is  a  factor  of  rp.^ ;  from  the  next  to  the  last  equation,  that 
every  common  factor  of  rp  and  rp_^  is  a  factor  of  rp_2;  etc.;  finally, 
that  every  common  factor  of  r^  and  r-^  is  a  factor  of  5,  and  that 
every  common  factor  of  r^  and  5  is  a  factor  of  a.  Hence  every  factor 
of  rp  is  a  common  factor  of  a  and  b. 

Since  the  largest  factor  of  rp  is  rp  itself,  we  have  the  result  : 

Theorem  1.  In  Euclid's  algorithm  (1),  the  greatest  common  divisor 
of  a  and  b  is  rp. 

In  particular,  a  necessary  and  sufficient  condition  that  a  and  b  be 
relatively  prime  is  that  rp  =  1. 

*  This  is  possible  even  if  a  <  6,  the  only  peculiarity  in  this  case  being  that  the 
quotient  is  zero  and  the  remainder  equal  to  a. 
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We  will  next  deduce  from  the  equations  (l)an  important  formula 
by  means  of  which  Vp  is  expressed  in  terms  of  a,  5,  and  the  ^'s. 
From  the  first  equation  (1)  we  have 

Substituting  this  value  into  the  second  equation,  we  get  for  r^ 
the  value  r^=^q^a-h {q^q^  +  1)^. 

Substituting  the  values  for  r^  and  r^  just  found  in  the  third 
equation,  we  get    ^^  ^  (^^^^  +  i^  ^  _  (^^^^^^  +  ^^  +  ^^^j. 

Proceeding  in  this  way,  we  can  express  each  of  the  r's,  and  there- 
fore ultimately  rp,  in  terms  of  a  and  h.  In  order  to  express  con- 
veniently the  general  formula  here,  we  introduce  the  following 
notation  : 

[«i]  =  «i' 


It  will  be  seen  that  the  values  of  r^,  r^,  r^  found  above  are  included 
in  the  formula 

By  the  method  of  mathematical  induction  this  formula  will  therefore 
be  established  for  all  values  oi  k^p  if,  assuming  that  it  holds  when 
k<k-^^  we  can  show  that  it  holds  when  k  =  Jc^-\-l.  This  follows  at 
once  when,  in  the  formula 

we  substitute  for  rj^^  and  rk^_^  their  values  from  (3)  and  reduce  the 
resulting  expression  by  means  of  the  definitions  (2). 
We  have  therefore  established  the  formula 

(4)  rp  =  Aa-{-  Bh, 

where      ^  =  (  -  If-^  [^p  q^,-"  qp.{\ ,    ^  =  (  -  1 )"  [^O'  ^r  •  *  •  ^p-i]  • 

Since  the  ^''s  are  integers,  it  is  clear  that  A  and  B  will  be  integers. 
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The  most  important  application  of  the  result  just  obtained  is  to 
the  case  in  which  a  and  h  are  relatively  prime.     Here  Vp  =  1,  and  we 
have 
(5)  Aa  +  Bb  =  l. 

Conversely,  if  two  integers  A  and  B  exist  which  satisfy  (5),  a  and  b 
are  relatively  prime,  as  otherwise  the  left-hand  side  of  (5)  would 
have  a  factor  greater  than  1. 

Theorem  2.  A  necessary/  and  sufficient  condition  for  a  and  h  to  he 
relatively  prime  is  that  there  exist  two  integers  A  and  B  such  that 
Aa  +  Bh  =  1. 

EXERCISES 

1.  Prove  that  [ai,a2,  •••  «n]  =  [«mCt«-i,  •••  ai]- 
[Suggestion.     Use  the  method  of  mathematical  induction.] 

2.  Prove  that  the  numerical  values  of  the  integers  A  and  B  found  above  are 
respectively  less  than  ^b  and  |a. 

[Suggestion.  Show  that  a/b  =[^o' •••?p]/[?i' *•' ?p]>  ^^^  *hat  this  second 
fraction  is  expressed  in  its  lowest  terms.] 

3.  Prove  that  there  can  exist  only  one  pair  of  integers  A  and  B  satisfying  the 
relation  Aa  -{-  Bb  =  1  and  such  that  A  and  B  are  numerically  less  than  ^b  and  ^a 
respectively. 

67.  The  Greatest  Common  Divisor  of  Two  Polynomials  in  One 
Variable.  In  place  of  the  integers  a  and  b  of  the  last  section,  we 
consider  here  the  two  polynomials : 


(1) 


By  the  greatest  common  divisor  of  these  two  polynomials  is 
meant  their  common  factor  of  greatest  degree.*  It  will  turn  out  in 
the  course  of  our  work  that  (except  in  the  case  in  which /and  </>  are 
both  identically  zero)  this  greatest  common  divisor  is  completely 
determinate  except  for  an  arbitrary  constant  factor  which  may  be 
introduced  into  it. 

*  Many  English  and  American  text-books  use  the  term  highest  common  factor ;  but 
as  there  is  not  the  slightest  possibility  that  the  word  greatest,  here,  should  refer  to  the 
value  of  the  polynomial,  since  the  polynomial  has  an  infinite  number  of  values  for  dif- 
ferent values  of  the  argument,  it  seems  better  to  retain  the  traditional  term. 
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We  will  assume  that  neither/ nor  <^  is  a  mere  constant,  and  that 
the  notation  has  been  so  introduced  that  /  is  of  at  least  as  high 
degree  as  </> ;  that  is,  we  assume 

^OTtO,     60^0,     n>m>0. 

Let  us  apply  Euclid's  algorithm  to/  and  <^  precisely  as  in  §  66  we 
applied  it  to  a  and  h.     We  thus  get  the  system  of  identities 

^  {x)  =  Qlx)R^{x)  -f  R^x), 
(2)  \     Rlx)=Qlx)Rlx)^Elx), 


I  R,.^{x)  =  Qlx)Rlx)  +  J?,+i. 

For  the  sake  of  uniformity  we  will  write 

<i>{x)  =  R^^(x), 

Then  R^^  R^^  ^2^  **•  ^^'^  polynomials  of  decreasing  degrees,  so  that 
after  a  finite  number  of  steps  a  remainder  is  reached  which  is  a  con- 
stant.    This  remainder  we  have  indicated  by  i^p+j. 

From  this  algorithm  we  infer,  as  in  §  66^  that  every  common 
factor  of  /  and  (/>  is  a  factor  of  all  the  iZ's,  and,  on  the  other  hand, 
that  every  common  factor  of  two  successive  R's  is  a  factor  of  all  the 
preceding  .^"s  and  therefore  of  /  and  </>.  Accordingly,  if  /  and  (/> 
have  a  common  factor  which  is  not  a  constant,  this  common  factor 
must  be  a  factor  of  the  constant  Rp^^^  and  therefore  Rp^i  =  0.  Con- 
versely, if  J?p^j=  0,  the  polynomial  Rp(x)  is  itself  a  common  factor 
of  Rp  and  Rp+i^  and  therefore  of /and  <^.     Hence  the  two  theorems : 

Theorem  1.  A  necessary  and  sufficient  condition  that  two  poly- 
nomials in  one  variable  f  and  </>,  neither  of  which  is  a  constant^  he 
relatively  prime  is  that  in  Euclid'' s  algorithm^  (2),  Rp+i  =#=  0. 

Theorem  2.  If  in  Euclid's  algorithm,  (2),  i^^+j  =  0,  then  Rp(x)  is 
the  greatest  common  divisor  off  and  (f). 

By  means  of  this  theorem  we  are  in  a  position  to  compute  the 
greatest  common  divisor,  not  only  of  two,  but  of  any  finite  number,  of 
polynomials  in  one  variable.  Thus  if  we  want  the  greatest  common 
divisor  of  f(x),  4>{^)^  '^(x),  we  should  first  compute,  as  above,  the 
greatest  common  divisor  Rp{x)  of  /  and  <^,  and  then,  by  the  same 
method,  compute  the  greatest  common  divisor  of  Rp(x)  and  '\lr(x). 
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From  the  identities  (2)  we  can  compute  the  value  of  each  of  tlie 
remainders  in  terms  of  /,  <^,  and  the  quotients  Q.  The  formulae 
here  are  precisely  like  those  of  §  66,  and  give  for  Bp+^  the  value 

(3)  M,^,  ^  (  _  1  )p  [  ^^  (x),  Qlx),  .  • .  Qlx)]f{x) 

+  (-1^1  IQlx),  Qlx),  ...  Q,{x)-]<i>{x). 

Suppose,  now,  that  /  and  <^  are  relatively  prime.  We  may  then 
divide  (3)  by  Rp^^  and  get 

(4)  F{x)f{x)  +  ^{x)<i>{x)  =  1, 

( -  ly 


where 
(5) 


F{x)^ 


n 


p+i 


(_i\p+i 


B 


p+l      L 


QX^) 


QX^) 


From  the  definitions  (2),  §  66,  we  see  that  F  and  ^  are  polyno- 
mials in  X.  The  existence  of  two  polynomials  F  and  ^  which  satisfy 
(4)  is  therefore  a  necessary  condition  that/  and  (f)  be  relatively  prime. 
It  is  also  a  sufficient  condition  ;  for  from  (4)  we  see  that  every  com- 
mon factor  of /and  <f>  must  be  a  factor  of  1,  that  is,  must  be  a  con- 
stant.    Thus  we  have  proved  the  theorem: 

Theorem  3.  A  necessary  and  sufficient  condition  that  the  polyno- 
mials f(x)  and  (t>(x)  be  relatively/  prime  is  that  two  polynomials  F{x)  and 
^{x)  exist  which  satisfy  (4).* 

We  can  make  this  statement  a  little  more  precise  by  noting  the 
degrees  of  F  and  <^  as  given  by  (5).  For  this  purpose  let  us  first 
notice  that  if  «j,  •  •  ♦  %  are  polynomials  of  degrees  h^,  •  •  •  k^  respec- 
tively, [«j,  ...  «„]  will,  by  (2),  §  66,  not  be  of  degree  greater  than 
^1  +  •  •  •  +  ^«.  Now  let  the  degree  of  Rlx)  be  m^,  and,  as  above,  the 
degrees  of/ and  <^,  n  and  m  respectively.  Then  (cf.  (2))  the  degrees 
of  Qq,  Q-^,  Q2,  '"  will  hen  —  m,  m  —  m^,  m^—m^,  ...  respectively.  Ac- 
cordingly, by  (5),  the  degrees  of  F  and  <I>  are  respectively  not  greater 

'^  {m  —  m^)  +  (wj  —  m'2)  4- h  (^p  -1  —  ^p)  =  m  —  nip, 

and  (n  —  m)-\-{m  —  m^)  +  (m^  —  m^)  +  •  •  •  4-  (Wp_ j  —  nip)  =  n  —  m^. 

Hence,  since  mp>0,  F  is  of  degree  less  than  m,  and  ^  of  degree  less 
than  n. 

*The  proof  we  have  given  of  this  theorem  applies  only  when  neither /nor  0  is  a 
constant.    The  truth  of  the  theorem  is  at  once  obvious  if  /  or  0  is  a  constant. 
o 
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Conversely,  we  will  now  show  that  if  F-^  is  a  polynomial  of  degree 
less  than  m,  and  <I>j  a  polynomial  of  degree  less  than  n^  and  if 

(6)  F^{xY{x)+^lx)<i>{x)  =  \, 

then  F^{x)  =  F{x),     ^-^{x)  =  ^(x). 

To  prove  this,  subtract  (6)  from  (4),  getting 

If  we  resolve  the  two  sides  of  this  identity  into  their  linear  fac- 
tors, we  see  that,  since  /  and  cf)  are  relatively  prime,  /  must  be  a 
factor  of  <E>j  —  ^  and  (f)  a  factor  of  F  —  F^.  This,  however,  is  pos- 
sible only  if  <l>j— 0  and  F—F^  vanish  identically,  as  otherwise 
they  would  be  of  lower  degree  than  /  and  <^  respectively.  We 
have  thus  proved  the  theorem  : 

Theorem  4.  Iff{x^  and  (f>{x)  are  relatively/  prime^  and  neither  is 
a  constant^  there  exists  one^  and  only  one,  pair  of  polynomials  F{x) 
<l>(:r),  whose  degrees  are  respectively  less  than  the  degrees  of  </>  and  f 
and  which  satisfy  the  identity  (4). 

Before  proceeding  to  the  general  applications  of  the  principles 
here  developed  which  will  be  found  in  the  next  section,  the  reader 
will  do  well  to  familiarize  himself  somewhat  with  the  ideas  involved 
by  considering  the  special  case  of  two  polynomials  of  the  second 
degree :  y^^^  ^a^x^-^-a^x  +  a^  «o  =^  ^' 

(l>(x)  =  hQX^-\-h^x  -\-b2  hf^^i  0. 

If  the  condition  that  these  two  polynomials  be  relatively  prime 
be  worked  out  by  a  direct  application  of  Euclid's  algorithm,  it 
will  be  found  necessary  to  consider  separately  the  cases  in  which 
^1^0  ~  ^0^1  ^^  ^^  ^^  ^^^  zero.  By  collating  these  results  it  will  be 
found  that  in  all  cases  the  desired  condition  is: 

{a^\  -  a^h^f  +  (a^b^  -  a^b^Xa^h^  -  a^b^)  ^  0. 

This  condition  may  be  found  more  neatly  and  quickly  by  obtaining 
the  condition  that  two  polynomials  of  the  form 

F{x)=p^x-\-p^, 

^{x)  =  qQX-hqi 

exist  which  satisfy  the  identity  (4). 

It  is  this  last  method  which  we  shall  apply  to  the  general  case  in 
the  next  section. 
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68.  The  Resultant  of  Two  Polynomials  in  One  Variable.     Let 

f{x)  =  a^x^  +  a^x^~'^  +    •  •  •    +  a^-^x  +  a^  a^^^O^  n>0^ 

^{x)  =  h^x^  +  h^x^-'^+    ..•    +h^_^x  +  h^  h^^O,m>0. 

The  condition  that  these  polynomials  be  relatively  prime  consists, 
as  we  see  from  Theorem  4,  §  67,  in  the  existence  of  constants  ^q, 
Pv  •••  Pm-v  %^  ^v  •••  9n-i  such  that 


(Pi 


X 


,TO— 1 


.TO— 2 


■\-   'P^'^     ^  + 


Equating  coefficients  of  like  powers  of  x^  we  see  that  the  following 
system  of  equations  is  equivalent  to  the  last  written  identity: 


(1) 


HV^-^%P\ 


=  0, 

=  0, 


^mPo  +  <^m-\P\  +    •  •  •   +  HPra-\  +  ^m%  +  ^m-l^l  H f"  ^O^m  =  0, 

^TO+ll?0  +  ^mPx  +    •  •  •   +  CL^Pm-\  +  ^m^l  +    +  ^0?  m+1  =0, 


^«Po  "^  ^n-lPl  "^ f"  ^n-m+lPm-1 

^nPl "I  I    ^n-m+2Ptn-\ 


+  ^TO9'«-m+l   +•••    +%„_i  =  0, 


^nPm~\ 


i^m9n-l —  ^» 


In  writing  these  equations  we  have  assumed  for  the  sake  of 
definiteness  that  n^m,  though  the  change  would  be  immaterial  if 
this  were  not  the  case.  This  is  a  system  oi  m-\-n  linear  equations 
in  the  m-{-n  unknowns^Q,  •  •  •  Pm-v  %•>  ' ' '  %-v  whose  determinant,  after 
an  interchange  of  rows  and  columns  and  a  shifting  of  the  rows,  is 


^0         ' 

0     ar 


a„  0 


a^  0  . 


.  0 
.  0 


.  0  a, 


0    h 


0  h 


h^    0 


0 


a  determinant  which,  it  should  be  noticed,  has  m+n  rows  and  columns. 
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If  72=?^  0,  the  set  of  equations  (1)  has  one  and  only  one  solution, 
and  /  and  <^  are  relatively  prime.  If  i2  =  0,  two  cases  seem  at  first 
sight  possible  (cf.  §  16):  either  the  system  of  equations  has  no  solu- 
tion, or  it  has  an  infinite  number  of  solutions.  This  latter  alterna- 
tive cannot,  however,  really  arise,  for  we  have  seen  in  Theorem  4, 
§  67,  that  not  more  than  one  pair  of  polynomials  F  and  ^  can  exist 
which  satisfy  formula  (4)  of  that  section  and  whose  degrees  do  not 
exceed  m  —  1  and  n  —  1  respectively.  Accordingly,  if  i^  =  0,  the 
set  of  equations  has  no  solution  and  /  and  <^  have  a  common  factor. 

li  is  called  the  resultant  of /and  </).^ 

The  term  resultant  has  thus  been  defined  only  on  the  supposition 
that  /  and  (^  are  both  of  at  least  the  first  degree.  It  is  desirable  to 
extend  this  definition  to  the  case  in  which  one  or  both  of  these 
polynomials  is  a  constant.  Except  in  the  extreme  case  m  =  ri  =  0, 
we  will  continue  to  use  the  determinant  R  as  the  definition  of  the 
resultant.     Thus  when  m  =  0,  92  >  0  we  have 

n      ...  n   \  n(n-\) 

W.A1  M"n    \  /  ^   \ 7^ 


\0q  J 

If  ^0^0  we  have  i^  ^  0,  and  moreover  in  this  case  /  and  <^  are 
relatively  prime    since   the  constant  </>  has  no  factors  other  than 
constants.     If,  however,  h^  —  0,  we  have  ^  =  0,  and  every  factor  of 
/  is  a  factor  of  <^,  since  <^  is  now  identically  zero. 
Similarly  when  n  =  0,  m  >  0,  we  have 

and  we  see  that  a  necessary  and  sufficient  condition  that  /  and  </>  be 
relatively  prime  is  that  7^:9^0. 

Finally,  when  92  =  m  =  0,  we  define  the  symbol  H  [  ^  j,  which  we 
still  use  to  denote  the  resultant,  by  the  formula  ^^ 

7?  (^^  _  I  ^         when  a^  and  h^  are  not  both  zero. 


=1; 


h^J       1 0         when  a^  =  5^  =  0. 

We  may  now  say  with  entire  generality: 

Theorem.  A  necessary  and  sufficient  condition  for  two  'polynomials 
in  one  variable  to  he  relatively  'prime  is  that  their  resultant  do  not  vanish. 

For  another  method  of  approach  to  the  resultant,  cf .  Exercise  4 
at  the  end  of  §  76. 

*  It  should  be  noticed  that  the  resultant  of  0  and  /  may  be  the  negative  of  the 
resultant  of  /  and  <t>.    This  change  of  sign  is  of  no  consequence  for  most  purposes. 
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69.  The  Greatest  Common  Divisor  in  Determinant  Form. 

Definition.  By  the  ith  subresultant  Mi  of  two  polynomials  in 
Qne  variable  is  understood  the  determinant  obtained  by  striking  out  the 
first  i  and  the  last  i  rows  and  also  the  first  i  and  the  last  i  columns  from 
the  resultant  of  these  polynomials. 

Thus  if  the  polynomials  are  of  degrees  5  and  3  respectively,  the 
resultant  72  is  a  determinant  of  the  eighth  order,  M^  of  the  sixth, 
II2  of  the  fourth,  and  M^  of  the  second,  as  indicated  below  : 

0  0 


B  = 


a. 


0 


0 


0 


ac 


a. 


a. 


a, 


E,= 


0 


0 


0 


^0 

0 
0 

7^2  = 

0 
0 

^0 

a^ 

«2 

H 

a^ 

«5 

h 
h 

0 
0 

0 

0 

^0 

«1 

H 

h 
h 
h 

0 

.^0 

^1 

h 

h 

h 

h 

0 

0 


a, 


h 


0 


0 


0 


0 
0 
0 
0 


We  now  state  the  following  results,  leaving  their  proof  to  the 
reader: 

Lemma.     Iff^{x)  and  (f>i{x)  are  polynomials^  and 

f{x)  =  {x-  cc)f^(x),  <t)(x)  =  (x-  a)(t>i(x), 

the  resultant  of  f^  and  c^^  and  their  successive  subresultants  are  equal 
respectively  to  the  successive  subresultants  off  and  (j). 

Theorem  1.  The  degree  of  the  greatest  common  divisor  of  f{x) 
and  (f>{x)  is  equal  to  the  subscript  of  the  first  of  the  subresultants 
Bq  =  B,  i^j,  B2^  •  •  •  which  does  not  vanish. 

Theorem  2.  If  i  is  the  degree  of  the  greatest  common  divisor  of 
two  polynomials  f{x)  and  </>(a;),  then  this  greatest  common  divisor  may 
be  obtained  from  the  ith  subresultant  of  f  and  (f>  by  replacing  the  last 
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element  in  the  last  roiv  of  coefficients  of  f  hy  f(x\  the  element  just  above 
this  hy  xfix)^  the  element  above  this  by  x^f{x\  etc.;  and  replacing 
the  last  element  in  the  first  row  of  coefficients  of  <f)  by  <t>{x)^  the  element 
below  this  by  x(f>{x)^  etc. 

If,  for  instance,  the  degrees  of  /and  </>  are  5  and  3  respectively, 
and  i  =  1,  the  greatest  common  divisor  is 


0 
0 

0 
0 
b. 


a^ 
a^ 
0 
0 


a^ 
0 

5o 


«3 
h 

b^ 


a, 


Ut 


0 


xf(x) 
(t>{x) 

X(f)(x) 
X^(j){x) 

x^(j){x) 


70.   Common   Roots   of   Equations.      EHmination.      Consider   the 
equations  f(x)  =  a^x^  +  a^x''-'^  + -" +a^=0  a^^O, 


(t){x)  =  b^x'^  +  b^x^-'^  +  . . .  +  5^  =  0 


^=^0, 


whose  roots  are  a^,  Wg,  •••«„  and  /3j,  /Sg,  •••/8^,  respectively;  and 
suppose /(a;)  and  <i>{x)  resolved  into  their  linear  factors: 

f{x)  =  a^{x-  a^){x -a^)--'{x-  «„), 

^{x)  =  b^x  -  I3^){x  -^^)'--{x-  13^). 

Since,  by  Theorem  1,  §  65,  these  sets  of  factors  are  unique,  it  is 
evident  that  the  equations /(:r)=  0  and  (f>{x)=  0  will  have  a  common 
root  when,  and  only  when,  f(x)  and  (t)(x)  have  a  common  factor, 
that  is,  when,  and  only  when,  the  resultant  H  of/ and  <^  is  zero. 

To  eliminate  x  between  two  equations  f{x)  =  0  and  (j)(x)  =  0,  is 
often  taken  in  elementary  algebra  to  mean :  to  find  a  relation  between 
the  coefficients  of/ and  (f>  which  must  hold  if  the  two  equations  are 
both  satisfied;  that  is,  to  find  a  necessary  condition  for  the  two 
equations  to  have  a  common  root.  For  most  purposes,  however, 
when  we  eliminate  we  want  a  relation  between  the  coefficients  which 
not  only  holds  when  the  two  equations  have  a  common  root,  but 
such  that,  conversely,  when  it  holds  the  equations  will  have  a 
common  root.  From  this  broader  point  of  view,  to  eliminate  x 
between  two  equations  f{x)  =  0  and  (t>{x)  =  0  means  simply  to  find 
a  necessary  and  sufficient  condition  that  these  equations  have  a 
common  root.  Hence  the  result  of  this  elimination  is  i?  =  0.  Let 
us,  however,  look  at  this  question  from  a  little  different  point  of  view. 


FACTORS  OF  POLYNOMIALS  IN  ONE  VARIABLE 


199 


In  the  equations 


(1)  a^Q^  +  a-^x'^ -\- a^x^ -\- a^x^  +  a^x -\- a^  =  0  a^^O, 

(2)  b^x^ -\-  b^x^ -{-b^x+b^  =  0  b^^ 0, 

let  us  consider  the  different  powers  of  x  as  so  many  distinct  un- 
knowns. We  have,  then,  two  non-homogeneous,  linear  equations  in 
the  five  unknowns  a;,  x^^  o^^  x^,  x^.  Multiplying  (1)  through  by  x 
and  then  by  a^,  and  (2)  by  x^  o?^  y?^  a;*,  in  turn,  we  have 

a^x^  +  a^x^  +  a^  +  a^x'^  -\-  a^x^  +  a^x^  =  0, 

a^x^  +  a^x^  +  a^x"^  +  a^x^  +  a^x^  +  a^x  =  0, 

a^a^  -\-  a^x^  +  a^x^  +  a^x^  +  a^x  -\-  a^  =0, 

b^a^  +  b-^x^  +  ^ga:  +  Sg  =0, 

5^2:*  -\-  b-^x^  +  b^x^  -f  ^ga:  =  0, 

K^  +  ^1^^  +  h^  +  ^3^  =  ^^ 

b^x^  +  ^^a:^  +  ^2^^  +  ^3^  =  ^» 

^0^^  +  b^a^  +  ^ga;^  4-  Ir^x^  =  0, 

a  system  of  eight  non- homogeneous,  linear  equations  in  seven 
unknowns. 

If  a  value  of  x  satisfies  both  (1)  and  (2),  it  will  evidently  satisfy 
all  the  above  equations.  These  equations  are  therefore  consistent, 
so  that  by  Theorem  1,  §  16,  we  have. 


% 

«i 

«2 

«3 

«4 

«5 

0 

0 

0 

«o 

^1 

«2 

ag 

a^ 

«5 

0 

0 

0 

% 

a^ 

«2 

«3 

«4 

a 

0 

0 

0 

0 

^0 

h 

h 

b 

0 

0 

0 

^0 

^1 

h 

h 

0 

0 

0 

^0 

^1 

h 

h 

0 

0 

0 

^0 

h 

h 

h 

0 

0 

0 

b. 

b. 

b. 

b. 

0 

0 

0 

0 

=  0. 


Hence  the  vanishing  of  this  determinant  is  a  necessary  condition  for 
(1)  and  (2)  to  have  a  common  root. 

This  device  is  known  as  Silvester's  Bialytic  Method  of  Elimina- 
tion.^ 


*  For  the  sake  of  simplicity  we  have  taken  the  special  case  where  ?i  =  5  and  m  =  3. 
The  method,  however,  is  perfectly  general. 
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The  above  determinant  is  seen  to  be  exactly  the  same  as  the 
resultant  R  of  (1)  and  (2),  so  tliat  Sylvester's  method  leads  to  the 
same  condition  for  two  equations  to  have  a  common  root  as  that 
found  above,  namely  11=  0,  It  does  not  prove,  however,  that  this 
condition  is  sufficient,  but  merely  that  it  is  necessary.  Thus  Syl- 
vester's method,  while  brief,  is  very  imperfect. 

The  number  of  roots  common  to  two  equations,  f{x)  =  0  and 
(f)(x)=  0,  and  also  an  equation  for  computing  the  common  roots, 
may  be  obtained  at  once  from  §  69. 

71.  The  Cases  Oq  =  0  and  b^  =  0.  It  is  important  for  us  to  note 
that  according  to  the  definitions  we  have  given,  the  determinant 
j5  J  ^0'  • "    n\  ^jii  ]jQ  ^Y^Q  resultant  of  the  two  polynomials 

(i>{x)  =  h^x^-]r  l\x^-^  +"--\-h^, 

only  when  /  and  (f)  are  precisely  of  degrees  n  and  m  respectively, 
that  is,  only  when  ^q=5^  0,  b^^O.  Thus,  for  instance,  the  resultant  of 
the  polynomials      f^^^  ^  ^^^n-i  +  ^^^n-2  +  . . .  +  ^^ 

(l>{x)  =  b^x'""  +  b^x'^-'^  H ^b, 

is  not   the  (m  +  7i)-ro wed  determinant  i^  ( , '    ^'  *      ^  )  but,  if  a^  ^  0, 

b^^  0,  the  (m  +  n  —  l)-rowed  determinant  E  (  i'  '  ^""j  or,  if  a^  or  b^ 
is  zero,  a  determinant  of  still  lower  order. ^      \  o'  '"   m/ 

Let  us  indicate  by  R  the  (m  +  7i)-rowed  determinant  ^  ( ,  ^'      j^ 

and  by  r  the  resultant  of  /  and  </>,  and  consider  the  case  a^  =  0, 
a^^O,  b^^O.  Since  every  element  of  the  first  column  of  E  except 
the  last  is  zero,  we  may  write 

E={-lY^''-^b^r. 
In  a  similar  way  we  see  that  if  the  degree  of  /  is  n  —  i,  and  5q  =^  0,  we 
may  write  E=±blr, 

and  if  the  degree  of  </>  is  m  —  %  and  a^  ^  0,  we  have 

E  =  a^r. 
Accordingly,  except  when  a^  =  ^^  =  0,  E  differs  from  r  only  by  a 
non-vanishing  factor. 

*  As  an  illustration  take  the  two  polynomials  f(x)  =  (a  +  jS)  x^  +  x  -  /3  and 
(f>(x)=ax-\-l.  If  a-\-^=^0  and  a  ^  0,  the  resultant  here  is  (a'^-l)^.  But  if 
o  =  —  /3  T^  0,  the  resultant  is  1  —  a^. 
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Theorem.     Although  R[  J^'  "    "']  is  the  resultant  off  and  </>  onli/ 

when  aQ=^0  and  h^^O  (or  when  m  =  0  or  n=  0)^  nevertheless  its 
vanishing  still  forms  a  necessary  and  sufficient  condition  that  f  and  (j> 
have  a  common  factor  even  when  a^  =  0  or  h^=  0,  provided  merely  that 
both  a^  and  b^  are  not  zero. 

That  this  last  restriction  can  not  be  removed  is  at  once  evident; 
for,  if  aQ  =  bQ  =  0,  every  element  in  the  first  column  of  the  determinant 
is  zero,  and  hence  the  determinant  vanishes  irrespective  of  whether 
/and  (j)  have  a  common  factor  or  not.*  All  that  we  can  say,  if  we 
do  not  wish  to  make  this  exception,  is,  therefore,  that  in  all  cases  the 
vanishing  of  M  forms  a  necessary  condition  that  /  and  (f)  have  a 
common  factor. 

72.  The  Resultant  of  Two  Binary  Forms.  Let  us  now  consider 
the  binary  forms 

f{x-^,  x^  =  a(^x\-\-  a-^x\~'^X2+  •••  -\- a^x^  (^^1) 

(i>{x^.  x^)=b^x^  +  b^:x^-%+  ...  +5^:r^  (m^l). 

By  the  side  of  these  forms  we  write  the  polynomials  in  one 
variable  ^(^^  =  ^^  a^^  +  «i  x«-i  +•••+«, 

^{x)  =  bQx'^  +  h^x'"'-^  +  '•'  +b, 


Un,   •••  bU 


The  determinant  R 

will  be  the  resultant  of  F  and  <I>  only  when  neither  a^  nor  b^  is  zero. 
We  will,  however,  call  it  the  resultant  of  the  binary  forms  /  and  </> 
in  all  cases. 

*  By  looking  at  the  question  from  the  side  of  the  theory  of  common  roots  of  two 
equations  (cl  §  70),  and  by  introducing  the  conception  of  infinite  roots,  we  may  avoid 
even  this  last  exception.     An  equation 

UqX'^  +  aiX'^-'^ -\- \-ctn=0 

has  n  roots,  distinct  or  coincident,  provided  ao=/^0.  If  we  allow  Gq  to  approach  the  value 
zero,  one  or  more  of  these  roots  becomes  in  absolute  value  larger  and  larger,  as  is  seen 
by  the  transformation  x'  =  1/x.  Hence  it  is  natural  to  say  that  if  (Zq  =  0  the  equation 
has  an  infinite  root.  If  then  we  consider  two  equations  each  of  which  has  an  infinite 
root  as  having  a  common  root,  we  may  say : 

A  necessary  and  sufficient  condition  that  the  equations 

a^x"^  +  aix''-'^ -\ |-«n  =  0  w>0, 

have  a  common  root  is  in  all  cases  the  vanishing  of  i?  ( .  ^'      ,  "  )  * 
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Theorem.  A  necessary  and  sufficient  condition  for  two  binary 
forms  to  have  a  common  factor  other  than  a  constant  is  that  their 
resultant  he  zero. 

If  a^  and  h^  are  both  different  from  zero,  the  non-homogeneous 
polynomials  F  and  ^  correspond  to  the  forms  /  and  </>  according  to 
the  definition  of  §  62.  Accordingl}'',  by  Theorem  2  of  that  section, 
a  necessary  and  sufficient  condition  that  /  and  <^  have  a  common 
factor  other  than  a  constant  is,  in  this  case,  the  vanishing  of  their 
resultant. 

On  the  other  hand,  if  a^  =  5q  =  0,  /  and  <^  have  the  common  factor 
a^g,  and  the  resultant  of /and  <^  obviously  vanishes. 

A  similar  remark  applies  to  the  case  in  which  all  the  a's  or  all 
the  5's  are  zero. 

There  remain  then  only  the  following  two  cases  to  be  considered, 

(1)  a^r^O;   b^  =  h^=  ...  =hf,=  0,h^+i^O  {k<m), 

(2)  h^^O;   aQ  =  a^=  ••.  =a^  =  0,  ^^+i^0  {k<n). 

In  Case  (1),  ^corresponds  to  f  and,  if  we  write 

^{^v  ^2)  =  ^■^ViC^r  ^2)' 

<I>  corresponds  to  </)j.  Now  we  know  in  this  case  (cf.  §  71)  that 
JR^O  is  a  necessary  and  sufficient  condition  that  F  and  <l>  be  rela- 
tively prime.  Accordingly,  by  Theorem  2,  §  62,  it  is  also  a  necessary 
and  sufficient  condition  that /and  (j>^  be  relatively  prime.  But  since 
X2  is  not  a  factor  of/,  the  two  forms  /  and  (j)  will  be  relatively  prime 
when  and  only  when/  and  (f)^  are  relatively  prime.  Thus  our  theo- 
rem is  proved  in  this  case. 

The  proof  in  Case  (2)  is  precisely  similar  to  that  just  given. 


CHAPTER   XVI 
FACTORS   OF   POLYNOMIALS   IN   TWO   OR  MORE   VARLABLES 

73.  Factors  Involving  only  One  Variable  of  Polynomials  in  Two 
Variables.  We  have  seen  in  the  last  chapter  that  polynomials  in 
one  variable  are  always  reducible  when  they  are  of  degree  higher 
than  the  first.  Polynomials  in  two,  or  more,  variables  are,  in  gen- 
eral, not  reducible,  as  we  have  already  noticed  in  the  special  case  of 
quadratic  forms. 

Let  f{x,  y)  be  any  polynomial  in  two  variables,  and  suppose  it 
arranged  according  to  powers-of  x^ 

f{x,  y)=a^{y)x''  +  a^{y)x''-^  ^  ...  +a^-i{y)x  + aj,y\ 

the  a's  being  polynomials  in  y. 

Theorem  1.  A  necessary  and  sufficient  condition  that  a  poly- 
nomial in  y  alone^  'i^il/)')  ^^  a  factor  of  f{x,  y)  is  that  it  be  a  factor  of 
all  the  a's. 

The  condition  is  clearly  sufficient.  To  prove  that  it  is  necessary, 
let  us  suppose  that  '^{y)  is  a  factor  oif{x,  y).     Then 

(1)  a,{y)x^+-.  -{-  a,{y)  =  ^^r{y)[h^{y)x-  +  ...  +5,(j/)], 

where  the  5's  are  polynomials  in  y.  For  any  particular  value  of  y 
we  deduce  from  (1),  which  is  then  an  identity  in  a;,  the  following 
equations  : 


<^n{y)  =  ^{y)K{yy 


Since  these  equations  hold  for  every  value  of  y,  they  are  identities, 
and  -^{y^  is  a  factor  of  all  the  a's. 
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Theorem  2.  If  f(x^y)  and  cfy^x,  y)  are  any  two  polynomials  in  x 
and  y,  and  "^{y)  is  an  irreducible  polynomial  in  y  alone  *  which  is  a 
factor  of  the  product  f(f>,  then  -sjr  is  a  factor  off  or  of  (f). 

Let  f{x,  y)  =  aQ(y)x''  +  a^(y)x''-^  -{■  •••  +«„(?/), 

and  (t>(x,y)  =  b,{y)x-^  +  b^{y)x^-^-^  ...  -\-bJy); 

then 

f(x,  y)<i>{x,  y)  =  a,b,x-^-^  +  {a,b^  +  a^b,)x-^-^-^ 

-\-{a^b^-\-  a^b^  +  a^b^)x''^'^-'^  +  ...  +  aj)^. 

In  order  to  prove  that  -v/r  is  a  factor  either  of  /  or  of  <^  we  must 
prove  that  it  is  either  a  factor  of  all  the  a's  or  of  all  the  5's.  If  this 
were  not  the  case,  we  could  find  a  first  a  in  the  sequence  a^^  a^^  •••  a^ 
of  which  y\r  is  not  a  factor.  Call  this  function  a^.  There  would  also 
be  a  first  b  in  the  sequence  of  5^,  5^,  .••  5^  which  is  not  divisible  by  -^/r. 
Call  this  function  bj.  Our  theorem  will  be  proved  if  we  can  show 
that  this  assumption,  that  a^  and  bj  are  not  divisible  by  -y/r  while  all 
the  functions  a^,  •-.  ai_^^  5o,  •••  ^y-j,  are  divisible  by  ^/r,  leads  to  a 
contradiction.  For  this  purpose  let  us  consider  in  the  product  f(f> 
the  coefficient  of  a;(«-»)+(»^-J),  which  may  be  written 

«o  h+3  +  •  •  •  +  cii-i  ^j+i  +  a^  bj  +  a^^^  ^y-i  +  •  •  •  +  «i+y  ^o' 

provided  we  agree  that  the  a's  and  J's  with  subscripts  greater  than 
n  and  m  respectively  shall  be  identically  zero.  Since  /</>  is  by 
hypothesis  divisible  by  -i/r,  it  follows  from  Theorem  1  that  the  last 
written  expression  must  be  divisible  by  -v/r.  This  being  obviously 
the  case  for  all  the  terms  which  preceed  and  for  all  which  succeed 
the  term  af>j^  it  follows  that  this  term  must  also  be  divisible  by  ^^r^ 
so  that  among  the  linear  factors  of  the  function  afij  must  be  found 
n/r.  But  by  Theorem  1,  §  Qb^  the  function  afij  can  be  resolved  into 
its  linear  factors  in  essentially  only  one  way,  and  one  way  of  so  re- 
solving it  is  to  resolve  a^  and  bj  into  their  linear  factors.  Since  y^  is 
not  one  of  these  factors,  we  are  led  to  a  contradiction,  and  our 
theorem  is  proved. 

An  important  corollary  of  our  theorem  is : 

Corollary.  Letf{x^  ?/)  and  (j)  (rr,  y)  be  polynomials  in  (x^y\  and 
let  '^(y)  be  a  polynomial  in  y  alone.  If  yjr  is  a  factor  of  the  product  of 
f(j>  but  is  relatively  prime  to  </>,  then  yjr  is  a  factor  off. 

*  That  is,  a  linear  polynomial. 
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If  yjr  is  irreducible,  this  corollary  is  identical  with  the  theorem. 
Let  us  suppose  yjr  resolved  into  its  irreducible  factors  none  of  which 
are  constants,  that  is,  into  its  factors  of  the  first  degree : 

f  (y )  =  fiil/)  ^2(y)  •  •  •  fM' 

Now  consider  the  identity  which  expresses  the  fact  that  -v/r  is  a  factor 

of/c^: 

(2)  f{x,  y)(\>{x,  y)  =  ylr^(y)ylr^(y)  •  •  •  irly)a{x,  y). 

This  shows  that  '^i{y)  is  a  factor  of  fcj)  and  hence,  by  Theorem  2, 
it  is  a  factor  either  of  /  or  of  (f>.  Since  yjr  and  (f)  are  relatively 
prime,  yjr^  cannot  be  a  factor  of  <j).     It  must  then  be  a  factor  of  / : 

f(x,y)  =  ^|r^{y)f^{x,y). 

Substituting  this  in  (2),  and  cancelling  out  yjr^^  as  we  have  a  right  to 
do  since  it  is  not  identically  zero,  we  get 

From  this  we  infer  that  y^^^  being  a  factor  oi  f^cp,  must  be  a  factor 

^  fi{^^  y)=i'ly)fl^^  y)' 

We  substitute  this  in  (3)  and  cancel  out  '^^.  Proceeding  in  this  way 
^®  ^^  f{x,  y)  =  '^ly)y\r^{y)  •  •  •  irly)flx,  y)  ~  ir{y)flx,  y), 

an  identity  which  proves  our  corollary. 

EXERCISE 

If  f{x,  y)  and  <f>(x,  y)  are  polynomials,  then  any  two  sets  of  polynomials 

-^2(2/)'  Q2(^'  y)y  -^2(^'  y)y 
wiU  be  proportional  to  each  other  provided, 
(a)  they  satisfy  the  identities 

Pi{y)f{^^  y)  =  Qi(^,  y)H^,  y)+R^{x,  y), 
^2(y)/(^'  y)  =  Q2(^r  y)<f>(^,  y)-^R<i{x,  ?/); 

(li)  there  is  no  factor  other  than  a  constant  common  to  Pi,  Qi,  and  also  no 
factor  other  than  a  constant  common  to  P^,  Q^ ; 

(c)  i?j  and  i?2  ^r®  both  of  lower  degree  in  x  than  <^. 
(Cf.  Theorem  2,  §  63.) 
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74.  The  Algorithm  of  the  Greatest  Common  Divisor  for  Polyno- 
mials in  Two  Variables.     We  will  consider  the  two  polynomials  in 

""  ^''^  ^'  f{x,  y)  ^  aly)x^  +  aly)x^-^  +  .  • .  +  a^y),       . 

^{x,  y)=hly)x^  +  ^i(^K-^  +  -  +  hj^y), 

and  assume  «q  ^  0,  6^ ^  0,  n^m  >0. 

Theorem  1  of  the  last  section  in  combination  with  the  results  of 
§  67  enables  us  to  get  all  the  common  factors  of  /  and  <^  which  in- 
volve y  only;  for  such  factors  must  be  common  factors  of  all  the  a's 
and  all  the  6's. 

It  remains,  then,  merely  to  devise  a  method  of  obtaining  the  com- 
mon factors  of/  and  </>  which  do  not  themselves  contain  factors  in  y 
alone.  We  will  show  how  this  can  be  done  by  means  of  the  algo- 
rithm of  the  greatest  common  divisor. 

Dividing  /  by  <^  (cf .  §  63,  Theorem  2),  we  get  the  identity 

^o{y)A^^  y)=  Qo(^^  ^)^(^'  y)  +  ^i(^'  ^> 

when  M^  is  either  identically  zero,  or  is  of  lower  degree  in  x  than  <j). 
If  H^  ^  0,  divide  <t>  by  iZ^,  getting  the  identity 

P\{y)4>{^^  y)  =  Qii.^^  y)^i(^'  y)  +  ^2(^'  y)^ 

where  M^  is  either  identically  zero,  or  is  of  lower  degree  in  x  than 
M^,  If  7^2  ^^^  divide  M^  by  Hy  Proceeding  in  this  way,  we  get  the 
following  system  of  identities  in  which  the  degrees  in  x  of  M^,  R^,  ••• 
continually  decrease,  so  that  after  a  certain  number  of  steps  we  reach 
an  i2,  say  i^p+j,  which  is  independent  of  x  : 

^oiy)f(^^  y)  -  ^o(^'  y)4'(^^  y)  +  -^i(^^  y)^ 

^i(y)^{^^  y)  =  Q\{^^  y)^\{^^  y)+J^2(x^  y\ 

^2(^)^i(^'  y)  =  ^2(^'  ^)^2(^'  y)  +  ^3(^'  y)^ 


(1) 


Pp-i{y)Bp-2(^.  y)  =  Qp-i{x,  y)R,_^{x,  y)+  RJ^x,  y\ 

^p(^)-^p-i(^'  y)  =  Ql^^  y)^l^^  y)  +  ^p+i(^)- 


Theorem  1.     A  necessary  and  sufficient  condition  thatf  and  <^  have 
a  common  factor  which  involves  x  is 
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In  order  to  prove  this  theorem  we  first  note  that,  by  the  first  of  the 
identities  (1),  any  common  factor  of/ and  </>  is  a  factor  of  i^^  hence, 
by  the  second  of  the  identities,  it  is  a  factor  of  R^^  etc.  Finally  we 
see  that  every  common  factor  of  f  and  cf)  is  a  factor  of  all  the  It's.  But 
i?p+-^  does  not  contain  a;.  Hence  if /and  <^  have  a  common  factor 
which  contains  ic,  Rp+i{y)  =  0. 

Now  suppose  conversely  that  llp+-^(y)  =  0,  and  let 

(2)  Blx,y)  =  S{y)G{x,y), 

where  Cr  has  no  factor  in  y  alone.*  The  last  identity  (1)  then  tells 
us  that  Pp{y)  is  a  factor  of 

Qp{x,  y)S{y)a{x,  y), 

and  since  by  hypothesis  G-  has  no  factor  in  y  alone,  Pp{y)  must,  by 
the  Corollary  of  Theorem  2,  §  73,  be  a  factor  of  QpS,  that  is 

(3)  Q,(x,y)S{y)  =  Ply)H{x,y). 

Substituting  first  (2)  and  then  (3)  in  the  last  identity  (1),  and  cancel- 
ling out  the  factor  Pp{y)  from  the  resulting  identity,  as  we  have  a 
right  to  do  since  Pp{y)  ^  0,  we  get  the  result 

Ilp--^{x,  y)  =  H{x,  y)G{x,  y). 

That  is,  (r  is  a  factor  not  only  of  Mp  but  also  of  Pp^^.  Accordingly 
we  may  write  the  next  to  the  last  identity  (1)  in  the  form 

Pp-i{y)Pp-2{^.  y^  =  J{x,  y)G(x,  y). 

By  the  corollary  of  Theorem  2,  §  73,  we  see  that  Pp-i{y)  is  a  factor 
of  e7,  so  that  Pp_^{y)  can  be  cancelled  out  of  this  last  written  identity, 
and  we  see  that  (7  is  a  factor  of  Rp-2' 

Proceeding  in  this  way,  we  see  that  (r  is  a  factor  of  all  the  ^'s^ 
and  therefore,  finally,  of  /  and  <j>.  Moreover,  we  see  from  (2)  that 
G  is  of  at  least  the  first  degree  in  x^  as  otherwise  Rp  would  not  con- 
tain x^  while  Rp+i  was  assumed  to  be  the  first  of  the  ^'s  which  did 
not  involve  x. 

Thus  our  theorem  is  proved. 

Since,  as  we  saw  above,  every  common  factor  of  /  and  <^  is  also  a 
factor  of  all  the  jR's,  it  follows  from  (2)  that,  if  t/t  is  a  common  factor 
of/  and  <^,  (3t(^^  ^)^(^)  =  ^(^^  y-) ^(^^  y-)_ 

*  If  Bp  has  no  factor  in  y  alone,  8  reduces  to  a  constant. 
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If  then  yjr  contains  no  factor  in  y  alone,  S  must,  by  the  Corollary  of 
Theorem  2,  §  73,  be  a  factor  of  jfiT.  Consequently  by  cancelling  out 
S  from  the  last  written  identity,  we  see  that  -v/r  is  a  factor  of  G-. 
That  is, 

Theorem  2.  If  in  Euclid^ s  algorltJim  Rp+i  =  0,  the  greatest  com- 
mon divisor  of  f  and  0  which  contains  no  factor  in  y  alone  is  the  'poly- 
nomial (jr{x^  y)  obtained  by  striking  from  Rp{x^  y^  all  factors  in  y 
alone. 

We  note  that  if  Rp+^  is  a  constant  different  from  zero,  /  and  ^ 
are  relatively  prime ;  but  that  the  converse  of  this  is  not  true  as  the 
simple  example 

f  =  2x''  +  ^y^  <t>  =  x 

shows. 

Going  back  to  the  identities  (1),  we  get  from  the  first  of  these 
identities,  by  mere  transposition,  the  value  of  M^  in  terms  of  /  and 
(f)  (and  Pq,  ^q).  Substituting  this  value  in  the  second  identity,  we 
get  a  value  for  M^  in  terms  of  /,  (/>,  and  certain  P's  and  Q's.  Pro- 
ceeding in  this  way,  we  finally  get  the  formula 

(4)  -^P+ily)  =  ^(^.  ^)/(^,  ^)  +  ^(^,  ^)^(^,  I/) 

where  F  and  ^  are  polynomials  in  (a;,  y). 

75.   Factors  of  Polynomials  in  Two  Variables. 

Theorem  1.  If  f{x^  y)  and  </)(a;,  y)  are  any  two  polynomials  in  x 
and  y,  and  yfr^x,  y)  is  an  irreducible  polynomial  which  is  a  factor  of 
the  product  f(j>^  then  yjr  is  a  factor  of  f  or  of  cj). 

If  yjr  does  not  contain  both  x  and  y,  this  theorem  reduces  to 
Theorem  2,  §  73.  It  remains,  then,  only  to  consider  the  case  that  yjr 
involves  both  variables.  In  this  case,  at  least  one  of  the  polynomials 
/,  (/>  must  be  of  at  least  the  first  degree  in  x.  Without  loss  of  gener- 
ality we  may  assume  this  to  be/.  If  -v/r  is  a  factor  of/,  our  theorem 
is  true.  Suppose -v/r  is  not  a  factor  of /;  then,  since -v^  is  irreducible,  / 
and  yjr  are  relatively  prime,  and  if  we  apply  the  algorithm  of  the 
greatest  common  divisor  to  /  and  yjr  (as  we  did  in  the  last  section  to 
/  and  <^)  the  first  remainder  Hp+i  (y)  which  does  not  involve  x  is  not 
identically  zero.     The  identity  (4)  of  the  last  section  now  becomes 

(1)  Bp^,(y)  =  Fix,  y)f{x,  y)  +  ^(:r,  y)^(a;,  y). 
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If  we  multiply  this  by  <^(a;,  y),  the  second  member  becomes  a  poly- 
nomial which  has  i/r  as  a  factor,  since,  by  hypothesis,  /<^  has  i/r  as  a 
factor.     We  may  therefore  write 

Now  no  factor  other  than  a  constant  of  Rp+i  can  be  a  factor  of  i/r, 
since  -^/r  is  irreducible.  Consequently,  by  the  Corollary  of  Theorem 
2,  §  73,  Rp+i  is  a  factor  of  ;^(a;,  «/).  Cancelling  out  Rp+i  from  (2), 
as  we  have  a  right  to  do  since  it  does  not  vanish  identically,  we  get 
an  identity  of  the  form 

^{x,  y)  =  ^\r{x,  y)xx{x,  y); 

that  is,  i/r  is  a  factor  of  <^,  and  our  theorem  is  proved. 

By  applying  this  theorem  a  number  of  times,  we  get  the 

Corollary.  If  the  product  of  any  number  of  polynomials  in  two 
variables,  f^x,  y)Mx,  y)  -  fix,  y), 

is  divisible  by  an  irreducible  polynomial  -yfr^x,  y),  then  yjr  is  a  factor  of 
at  least  one  ofthef's. 

We  come  now  to  the  fundamental  theorem  of  the  whole  subject 
of  divisibility  of  polynomials  in  two  variables. 

Theorem  2.  A  polynomial  in  two  variables  which  is  not  identically 
zero  can  be  resolved  into  the  product  of  irreducible  factors  no  one  of 
which  is  a  constant  in  one,  and  essentially  in  only  one,  way. 

That  a  polynomial  f{x,  y)  can  be  resolved  into  the  product  of 
irreducible  factors  no  one  of  which  is  a  constant  in  at  least  one  way 
may  be  seen  as  follows.  If/ is  irreducible,  no  factoring  is  possible 
or  necessary.     If/ is  reducible,  we  have 

f{x,  y)  -fix,  y)flx,  y), 

where  neither /^  nor/2  is  a  constant.  If  /^  and  f^  are  both  irredu- 
cible, we  have  a  resolution  of/  of  the  form  demanded.  If  not,  resolve 
such  of  these  polynomials /^  and/2  as  are  reducible  into  the  product 
of  two  factors  neither  of  which  is  a  constant.  We  thus  get  /  ex- 
pressed as  the  product  of  three  or  four  factors.  This  is  the  resolu- 
tion of /demanded  if  all  the  factors  are  irreducible.  If  not,  resolve 
such  as  are  reducible  into  the  product  of  two  factors,  etc.  This  pro- 
cess must  stop  after  a  finite  number  of  steps,  for  each  time  we  factor 
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a  polynomial  into  two  factors,  the  degrees  of  the  factors  are  lower 
than  the  degree  of  the  original  polynomial.  We  shall  thus  ulti- 
mately resolve  /  by  this  process  into  the  product  of  irreducible 
factors,  no  one  of  which  is  a  constant. 

Suppose  now  that  /  can  be  resolved  in  two  ways  into  the  product 
of  irreducible  factors  none  of  which  are  constants, 

/(^.  y)=fi{^^  y)f'2,{^'>  y)  -"fki^^  y) 

Since  <^i  is  a  factor  of  /,  it  must,  by  the  Corollary  of  Theorem  1, 
be  a  factor  of  one  of  the  polynomials /i,/2,  -"  fk-  Suppose  the/'s  so 
arranged  that  it  is  a  factor  of /j.  Then,  since /j  is  irreducible, /^  and 
<^j  can  differ  only  by  a  constant  factor,  and  since  c^^^O,  we  may  cancel 
it  from  the  identity  above,  getting 

^1/2/3  •'•fk  =  4>2^3  '"  ^i- 
In  the  same  way  we  see  from  this  identity  that/2  and  one  of  the 
0's,  say  (^2'  differ  only  by  a  constant  factor.     Cancelling  (j)^^  we  get 

^1^2/3  "'fjc  =  4*z  '"  ^l' 

Proceeding  in  this  way,  we  should  use  up  the  <^'s  before  the  /'s  if 
l<k^  the/'s  before  the  0's  if  Z  >  ^.  Neither  of  these  cases  is  possible, 
for  we  should  then  have  ultimately  a  constant  on  one  side  of  the 
identity,  and  a  polynomial  different  from  a  constant  on  the  other. 
Thus  we  must  have  k  =  I.  Moreover  we  see  that  the  /'s  can  be 
arranged  in  such  an  order  that  each  /  is  proportional  to  the  corre- 
sponding <^,  and  this  is  what  we  mean  (cf.  Definition  7,  §  60)  by 
saying  that  the  two  methods  of  factoring  are  not  essentially  different. 

Thus  our  theorem  is  proved. 

Theorem  3.  J^  two  polynomials  f  and  0  in  (x^  y)  are  relatively 
prime,  there  are  only  a  finite  number  of  pairs  of  values  of  (re,  y)  for 
which  f  and  cf)  both  vanish.* 

For  if/ and  (p  both  vanished  at  the  points 

and  if  these  points  were  infinite  in  number,  there  would  be  among 
them  either  an  infinite  number  of  distinct  x^s  or  an  infinite  number  of 

*  Stated  geometrically,  this  theorem  tells  us  that  two  algebraic  plane  curves 
y*(x,  y)  =  0,  0(x,  y)  =  0  can  intersect  in  an  infinite  number  of  points  only  when  they 
have  an  entire  algebraic  curve  in  common. 
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distinct  ?/'s.  By  a  suitable  choice  of  notation  we  may  suppose  that 
there  are  an  infinite  number  of  distinct  y's.  Then  it  is  clear  that  / 
and  <f)  must  be  of  at  least  the  first  degree  in  x^  since  a  polynomial  in  ^ 
alone  which  does  not  vanish  identically  cannot  vanish  for  an  infinite 
number  of  values  of  y.  We  may  then  apply  to /and  cj)  the  algorithm 
of  the  greatest  common  divisor  as  in  §  74,  thus  getting  (cf.  (4), 
§  74)  an  identity  of  the  form 

(4)  F{x,  y)f{x,  y)  +  ^(x,  y)<^(x,  y)  =  R,^^{y)  ^  0. 

Since  the  first  member  of  (4)  vanishes  at  all  the  points  (3),  Mp+-^{y) 
would  vanish  for  an  infinite  number  of  distinct  values  of  y,  and  this 
is  impossible. 

An  important  corollary  of  the  theorem  just  proved  is  that  if  /  and 
(j>  are  two  irreducible  polynomials  in  (x,  y),  and  if  the  equations 
/=  0  and  0  =  0  have  the  same  locus,  then/  and  </>  differ  merely  by 
a  constant  factor.  This  would,  however,  no  longer  be  necessarily 
true  if /and  <f>  were  not  irreducible,  as  the  example, 

f=xy,    ^  =  x^y, 

shows;  for  the  two  curves /  =  0  and  (j)  =  0  are  here  identical,  since 
the  curve  in  each  case  consists  of  the  two  coordinate  axes,  and  yet/ 
and  <j>  are  not  proportional.     By  means  of  the  following  convention, 
however,  the  statement  made  above  becomes  true  in  all  cases : 
Let/ be  resolved  into  its  irreducible  factors, 

where  /,  •  •  •  fjg  are  irreducible  polynomials  in  (x^  «/),  no  two  of  which 
are  proportional  to  each  other.  The  curve /=  0  then  consists  of  the 
*  pieces,  y^  =  0,/2=0,  •-/,  =  0. 

To  each  of  these  pieces  we  attach  the  corresponding  positive  integer 
f^i  which  we  call  the  multiplicity  of  this  piece ;  and  we  then  regard 
two  curves  given  by  algebraic  equations  as  identical  only  when  they 
consist  of  the  same  irreducible  pieces,  and  each  of  these  pieces  has 
the  same  multiplicity  in  both  cases.  With  this  convention  we  may 
say: 

Corollary.  Iff  and  <f)  are  polynomials  in  (rr,  y)  neither  of  which 
is  identically  zero,  a  necessary  and  sufficient  condition  that  the  two  curves 
/=  0,  (j)  =  0  be  identical  is  that  the  polynomials  f  and  (f>  differ  only  by 
a  constant  factor. 
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EXERCISES 

1.  Let/(a:),  <f>(x),  \f/(oc)  be  polynomials  in  a:  whose  coefficients  lie  in  a  certain 
domain  of  rationality.  Then  if  ij/  is  irreducible  in  this  domain  and  is  a  factor  of 
the  product /<^,  prove  that  if/  is  a,  factor  of /or  of  <f>. 

2.  Letf(x)  be  a  polynomial  in  x,  which  is  not  identically  zero,  and  whose 
coefficients  lie  in  a  certain  domain  of  rationality.  Prove  that /can  be  resolved 
into  a  product  of  polynomials  whose  coefficients  lie  in  this  domain,  which  are 
irreducible  in  this  domain,  and  no  one  of  which  is  a  constant,  in  one  and  essen- 
tially in  only  one  way. 

3.  Extend  the  results  of  this  section  to  polynomials  in  two  variables  whose 
coefficients  lie  in  a  certain  domain  of  rationality. 

76.  Factors  of  Polynomials  in  Three  or  More  Variables.  The  re- 
sults so  far  obtained  in  this  chapter  may  be  extended  to  polynomials 
in  three  variables  without,  in  the  main,  essentially  modifying  the 
methods  already  used.  We  proceed  therefore  to  state  the  theorems 
in  the  order  in  which  they  should  be  proved,  leaving  the  proofs  of 
most  of  them  to  the  reader.  The  extension  to  the  case  of  n  variables 
then  presents  no  difficulty,  and  is  left  entirely  to  the  reader  (cf . 
Exercise  1). 

Let /(a;,  y,  z)  be  any  polynomial  in  three  variables,  and  suppose  it 
arranged  according  to  powers  of  x, 

f(x,  y,  z)  =  a^{y,  z)x''  +  a^{y,  z)x''-^  +  •  •  •  4-  aj^y,  z), 

the  a's  being  polynomials  in  (?/,  z). 

Corresponding  to  Theorems  1,  2  of  §  73  we  have 

Theorem  1.  A  necessary  and  sufficient  condition  that  a  polynomial 
in  (^,  z)  he  a  factor  of  f  is  that  it  he  a  factor  of  all  the  a's. 

Theorem  2.  If  f{x^  3/,  2)  and  (f>{x,  y,  z)  are  any  two  polyno- 
mials in  (a;,  ^,  z)  and  yjr{y,  z)  is  an  irreducihle  polynomial  in  (3/,  z)only 
which  is  a  factor  of  the  product  f(f)^  then  yjr  is  a  factor  offor  of  (f>. 

Corollary.  Let  f{x^  3/,  z)  and  <^{x^  y,  z)  he  polynomials  in 
(x,  y,  s),  and  let  yjr{y^  z)  he  a  polynomial  in  (y,  z)  alone.  If  s^  is  a 
factor  of  the  product  of  f<^^hut  is  relatively  prime  to  </>,  then  it  is  a  fac- 
tor of  f 

To  find  the  greatest  common  divisor  of  two  polynomials  in  three 
variables  we  proceed  exactly  as  in  the  case  of  two  variables,  getting 
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a  set  of  identities  similar  to  (1),  §  74,  the  P's  and  Mp+i  being  now 
functions  of  (?/,  z),  while  the  other  ^'s  and  the  ^'s  are  functions  of 
(x,  1/,  z).     Corresponding  to  Theorems  1,  2  of  §  74  we  now  have 

Theorem  3.  A  necessary  and  sufficient  condition  thatf{x^  y^  z\  and 
^(x^y^  z)  have  a  common  factor  which  involves  x  is  that  Rp_^^{]j^  z)  =  0. 

Theorem  4.  If  i2p+i(y,  2;)=0,  the  greatest  common  divisor  of 
f(x,  y,  z)  and  <f)(x^  y^  z')  which  contains  no  factor  in  (?/,  z)  alone  is 
the  polynomial  G-(x^  y^  z)  obtained  by  striking  out  from  Rp{x^  y^  z) 
all  factors  in  (?/,  z)  alone. 

From  the  algorithm  of  the  greatest  common  divisor  for  the  two 
polynomials /(a;,  ?/,  2),  <^{x^y^z)  we  also  deduce  the  identity 

(1)  Rp+i{y,  2)  =  F{x,  y,  z)  f{x,  y,z)  +  ^  {x,  y,  z)4>(x,  y,  z), 

similar  to  (4),  §  74. 

Corresponding  to  Theorems  1,  2  of  §  75  we  have 

Theorem  5.  Jff^  f(x^  y^  z)  and  (f){x,  y,  z)  are  any  two  polynomials 
and'\jr(x^  y^  z)is  an  irreducible  polynomial  which  is  a  factor  of  the  prod- 
uct fcj)^  then  yjr  is  a  factor  of  f  or  of  (f>. 

Corollary.     If  the  product  of  any  number  of  polynomials 
f^{x,  y,  z)flx,  ^/,  z)  -■- fj,{x,y,z), 

is  divisible  by  an  irreducible  polynomial  yjr{x,  y,  2),  then  o/r  is  a  factor 
of  at  least  one  of  the  f's. 

Theorem  6.  A  polynomial  in  three  variables  which  is  not  identi- 
cally zero  can  be  resolved  into  the  product  of  irreducible  factors  no  one  of 
which  is  a  constant  in  one^  and  essentially  in  only  one.,  way. 

When  we  come  to  Theorem  3,  §  75,  however,  we  find  that  it 
does  not  admit  of  immediate  extension  to  the  case  of  three  vari- 
ables ;  for  Hp+iiy),  which  came  into  the  proof  of  that  theorem, 
becomes  now  ^p+j(?/,  ^),  and  we  can  no  longer  say  that  this  does  not 
vanish  at  an  infinite  number  of  points  (^,  z).  Not  only  is  the  proof 
thus  seen  to  fail,  but  the  obvious  extension  of  the  theorem  itself  is 
seen  to  be  false  when  we  recall  that  two  surfaces  intersect,  in  gen- 
eral, in  a  curve. 

This  theorem  may,  however,  be  replaced  by  the  following  one : 

Theorem  7.  Iff{x^  y^^)  ^^^  </>(^i  3/?  ^)  <^^^  ^^^  two  polynomials  in 
three  variables  of  which  (f)  is  irreducible^  and  if  f  vanishes  at  all  points 
{x,  y,  z)  at  which  (ft  vanishes.,  then  (f)  is  a  factor  off. 
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In  proving  this  theorem  we  may,  without  loss  of  generality, 
assume  that  </>  actually  contains  one  of  the  variables,  say  x\  for 
if  </)  contains  none  of  the  variables  rr,  y,  2,  the  theorem  is  trivial 
and  obviously  true. 

Suppose  </)  were  not  a  factor  of  /.  Then,  since  </>  is  irreducible, 
/  and  <^  are  relatively  prime.  Hence,  in  the  identity  (1)  above, 
i2p+i(«/,  z)^^.     Let  us  write 

(2)  <^{x,  y,  z)  =  b,(j/,  z)x-^  +  h^{y,  z)x-^-^  +  ...  +  h,,{y,  z)       (m>  1), 

where,  without  loss  of  generality,  we  may  assume  h^(i/^  z)  ^  0.     Then 

(3)  R,^^{y,z)h^{y,z)^0. 

Accordingly  we  can  find  a  point  (y^,  z-^  such  that 

(4)  ^P+i(yi.  ^1)^0,        \{y^.z^)^^. 

Consequently  <f>{x^  y^,  z-^  is  a  polynomial  in  x  alone  which  is  of  at 
least  the  first  degree,  and  which  therefore  (Theorem  1,  §  6)  vanishes 
for  some  value  x-^  of  x.     That  is 

Accordingly,  by  hypothesis. 

Referring  now  to  the  identity  (1),  we  see  that 

This,  however,  is  in  contradiction  with  (4).  Thus  our  theorem  is 
proved. 

If  to  each  part  of  a  reducible  algebraic  surface  we  attach  a  multi- 
plicity  in  precisely  the  same  way  as  was  explained  in  the  last  section 
for  plane  curves,  we  infer  at  once  the 

Corollary.  Iff  and  <f>  are  polynomials  in  (x^  y,  z)  neither  of  which 
is  identically  zero^  a  necessary  and  sufficient  condition  that  the  two  sur- 
faces y^O,  (/)=0 

he  identical  is  that  the  polynomials  f  and  </>  differ  only  by  a  constant 

factor. 

Theorem  7  admits  also  the  following  generalization  : 

Theorem  8.    Iff{x,  y,  z)  and  (j)(x,  y,  z)  are  any  two  polynomials  in 

three  variables  which  both  vanish  at  the  point  {x^^  y^,  2q)  and  of  which  (f) 

is  irreducible.,  and  if  in  the  neighborhood  N  of  (o^q,  y^,  z^^  f  vanishes  at 

all  points  at  which  (j)  vanishes^  then  <j>  is  a  factor  off. 
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We  assume,  as  before,  that  <f>  contains  x  and  can  therefore  be 
written  in  the  form  (2).  Let  us  first  consider  the  case  in  which 
5q(j/q,  Zq)  =^  0.  Here  the  proof  is  very  similar  to  the  proof  of 
Theorem  7. 

We  obtain  relation  (3)  precisely  as  above,  and  from  it  we  infer 
that  a  point  (y^,  z-^)  in  as  small  a  neighborhood  M  of  {y^^  z^)  as 
we  please  can  be  found  at  which  the  relations  (4)  are  true. 

Now  consider  the  equation 

(5)  </)(2:,  ^1,  ^i)=0. 

By  writing  <^  in  the  form  (2),  we  see  that  by  taking  the  neighbor- 
hood M  oi  (^Q,  z^  sufficiently  small,  we  can  make  the  coefficients 
of  (5)  differ  from  the  coefficients  of 

(6)  ii>{x,  y^,  z^)=  0 

by  as  little  as  we  please  (cf.  Theorem  3,  §  5).  Now  Xq  is  by  hy- 
pothesis a  root  of  equation  (6).  Consequently  by  taking  M  suffi- 
ciently small,  we  can  cause  (5)  to  have  at  least  one  root  x^  which 
differs  from  Xq  by  as  little  as  we  please  (cf.  Theorem  4,  §  6).  Thus 
we  see  that  a  point  (x^  y^,  z-^)  in  the  given  neighborhood  iV  of 
(^0'  ^0'  ^o)  ^^^  ^^  found  at  which 

Accordingly,  by  hypothesis. 

From  the  identity  (1)  we  have  then 

which  is  in  contradiction  with  (4).  Thus  our  theorem  is  proved  on 
the  supposition  that  ^oC^o'  ^o)  =^  ^'  * 

*  The  proof  just  given  will,  in  fact,  apply  to  the  case  in  which  not  all  the  5's  in  (2) 
vanish  at  the  point  (yo,  ^o)?  if  we  use  the  extension  of  Theorem  4,  §  6,  which  is  there 
mentioned  in  a  footnote.  It  is  only  the  extreme  case  in  which  all  the  6's  vanish  at  this 
point  which  requires  the  special  treatment  which  we  now  proceed  to  give.  The  reader 
is  advised  to  consider  the  geometrical  meaning  of  this  extreme  case. 
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In  order  to  treat  the  case  in  which  ^o(^o'^o)=  ^'  ^^^  ^^  denote  by 
h  the  degree  of  </>(a7,  ^,  s),  and  let  us  subject  this  polynomial  to  a 
non-singular  linear  transformation 

(7)  ^  y=a^x'  +  I3^y'  -\-r^^z' 

z  =  ag^;'  4-  ^^y'  +  732' 

which  makes  the  degree  of  <^  in  a;  equal  to  the  total  degree  k  oi  (f) 
(cf.    Theorem  2,  §64). 

Suppose  that  this  transformation  carries  over  the  point  (^o'^c'^o) 
into  the  point  (rr^,  y^,  z^).  Then  it  is  possible,  since  (7)  is  non- 
singular,  to  take  such  a  small  neighborhood  N'  of  (a;^,  3/^,  z'^)  that 
all  points  in  this  neighborhood  correspond  to  points  in  the  given 
neighborhood  iVof   (x^^  y^^  z^. 

Moreover,  by  means  of   (7),  <^  has  gone  over  into 

(8)  <i,'{x\  y',  z')  ^  blx'"  +  h[{y ',  a')^''"'  +  -  +  K{y',  2'). 

where  h[  is  a  constant  different  from  zero.  Let  us  denote  by 
f{x\  y\  z')  the  polynomial  into  which  /  is  transformed.  Then  it 
is  clear  that,  since,  in  the  neighborhood  iV,/  vanishes  whenever  (j> 
does,  in  the  neighborhood  iV'  (which  corresponds  to  a  part  of  iV),/' 
vanishes  whenever  c^^  does.  Accordingly  we  can  apply  the  part  of 
the  theorem  already  proved  to  the  two  polynomials  f  and  </)^  since 
the  first  coeihcient  of  <^'  in  the  form  (8),  being  a  constant  different 
from  zero,  does  not  vanish  at  (yj,  4)-     ^^  infer  that  <^'  is  a  factor 

°^-^''  f{x',  y\  z')  =  4,'{x',  y',  z')^'{x>,  y' ,  z'). 

If  here  we  replace  x' ^  ?/^  z^  by  their  values  in  terms  of  x^  y,  z  from 
(7),  we  see  that  ^  is  a  factor  of  / ;  and  our  theorem  is  proved. 

EXERCISES 

1.  State  and  prove  the  eight  theorems  of  this  section  for  the  case  of  polyno- 
mials in  n  variables. 

2.  Extend  the  result  of  the  exercise  at  the  the  end  of  §73  to  the  case  of 
polynomials  in  n  variables. 

3.  Extend  the  results  of  the  two  preceding  exercises  to  the  case  in  which  we 
consider  only  polynomials  whose  coefficients  lie  in  a  certain  domain  of  ration- 
ality. 
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4.    The  resultant  of  two  polynomials  in  one  variable 

f{x)  =  ciQX'^-]-  OjX^-^H h  On, 

<^   {X)  =\xr^+  \x'^-^+  •••  +  6m, 

is  sometimes  defined  as  the  polynomial  it  in  the  a's  and  &'s  of  lowest  degree  which 
satisfies  an  identity  of  the  form        „/.      j.  ,      -r^ 

where  F  and  <E>  are  polynomials  in  (oq,  •••  Onl  ^o'  "'  ^n»'  ^)'  ^^^  *^®  identity  is  an 
identity  in  all  these  arguments.  Prove  that  the  resultant  as  thus  defined  differs 
only  by  a  constant  factor  different  from  zero  from  the  resultant  as  we  defined  it 
in  §  68. 


CHAPTER   XVII 

GENERAL  THEOREMS  ON  INTEGRAL  RATIONAL 

INVARIANTS 

77.  The  Invariance  of  the  Factors  of  Invariants.  Let  us  con- 
sider the  general  w-ary  form  of  the  kth.  degree  which  we  will  rep- 
resent by  f{x-^->  •••  ^n  ;  ^11  ^2»  •••)»  ^^^®  ^'^  being  the  variables  and  the 
a's  the  coefficients.  By  suitably  changing  the  a's,  this  symbol  may 
be  used  to  represent  an]/  such  form.  Hence,  if  we  subject  such  a 
form  to  a  linear  transformation,  the  new  form,  being  n-ary  and  of 
the  same  degree  as  the  old,  may  be  represented  by  the  same  func- 
tional letter  :  /(^J,  '"  ^ni  ^'v  ^2'  *")•  This  new  form  will  evidently 
be  homogeneous  and  linear  in  the  a's  ;  that  is,  each  of  the  a^'s  is  a 
homogeneous  linear  polynomial  in  the  a's.  It  is  also  clear  that  each 
of  the  a''s  is  a  homogeneous  polynomial  of  the  ^th  degree  in  the 
coefficients  of  the  transformation. 

It  follows  from  the  very  definition  of  invariants  that  if  we  have 
a  number  of  integral  rational  relative  invariants  of  a  form  or  system 
of  forms,  their  product  will  also  be  an  integral  rational  relative  in- 
variant. It  is  the  converse  of  this  that  we  wish  to  prove  in  this 
section.  We  begin  by  stating  this  converse  in  the  simple  case  of  a 
single  form. 

Theorem  1.  If  I{a^^  a^,  •••)  is  an  integral  rational  invariant  of 
the  n-ary  form  -pr  .  \ 

and  is  reducible^  then  all  its  factors  are  invariants. 

It  will  evidently  be  sufficient  to  prove  that  the  irreducible  factors 
of  I  are  invariants,     hetf-^^f^,  -"fi  be  the  irreducible  factors  of  L 
Subjecting /to  the  linear  transformation 


^n  —  ^nl*^l  ~r   •••  -T  ^nn^ni 
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whose  determinant  we  call  e,  and  denoting  the  coefficients  of  the 
transformed  form  by  a^,  ai^^  •  •  • ,  we  have 

(1)  I(a[,  a^,  •••)  =  e^I{a^,  a^,  .••); 
an  identity  which  may  also  be  written 

We  have  here  a  polynomial  in  the  c's  and  a's  which,  on  the 
second  side  of  the  identity,  is  resolved  into  its  irreducible  factors, 
since  by  Theorem  1,  §  61,  the  determinant  e  is  irreducible.  Hence 
each  factor  on  the  first  side  is  equal  to  the  product  of  some  of  the 
factors  on  the  second.     That  is 

(2)  //aj,  4'  •  •  •)  =  (^^'H^v  H^'")  (**  =  1,  2,  . . .  Z) 
where  the  </)'s  are  polynomials. 

Now  let  _       _       _       _  1 

and  let  all  the  other  <?'s  be  zero.  Our  transformation  becomes  the 
identical  transformation,  the  determinant  c  =  1,  and  each  a^  is  equal 
to  the  corresponding  a.     The  identities  (2)  therefore  reduce  to 

fi{a^,  ^2'  •••)=  H.^v  ^2'  •••)  (2  =  1,  2, ...  I). 

Substituting  this  value  of  c^^  in  (2),  we  see  that  /»•  is  an  invariant, 
and  our  theorem  is  proved. 

The  general  theorem,  now,  is  the  following  : 

Theorem  2.    If  I  (a^,  a^,  "•  ;hpb^^  '";'•-)  is  an  integral  rational 
invariant  of  the  system  of  forms 


and  is  reducible^  then  all  its  factors  are  invariants. 

The  proof  of   this   theorem  is  practically  identical  with  that  of 
Theorem  1. 
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EXERCISE 

If  /(ai, 02,  •••;  &i, ^2)  •••;••• ;  yh"'yn'i  2i>--*2;„;  •••)  is  an  integral  rational  co- 
variant  of  the  system  of  forms 

/2(Xi,  —Xn;    bl,b2,'-)y 


and  the  system  of  points  (^i,  •••^/n)^  {^h 
factors  are  co variants  (or  invariants). 


Zn),"-,  and  is  reducible,  then  all  its 


78.  A  More  General  Method  of  Approach  to  the  Subject  of  Rela- 
tive Invariants.  We  have  called  a  polynomial  I  in  the  coefficients 
of  an  92-ary  form  /  a  relative  invariant  of  this  form  if  it  has  the 
property  of  being  merely  multiplied  by  a  power  of  the  determinant 
of  the  transformation  when /is  subjected  to  a  linear  transformation. 
It  is  natural  to  inquire  what  class  of  functions  Zwe  should  obtain  if 
we  make  the  less  specific  demand  that  /  be  multiplied  by  a  poly- 
nomial in  the  coefficients  of  the  transformation.  We  should  expect 
to  get  in  this  way  a  class  of  functions  more  general  than  the  invari- 
ants we  have  so  far  considered.  As  a  matter  of  fact,  we  get  precisely 
the  same  class  of  functions,  as  is  shown  by  the  following  theorem : 

Theorem.  Let  I  he  a  polynomial  not  identically  zero  in  the  co- 
efficients (^j,  a^^  •••)  of  an  n-ary  form  /,  and  let  (aJ-^^a'^^  •••)  he  the  co- 
efficients of  the  form  ohtained  hy  subjecting  f  to  the  linear  transformation 


X-^  —  ^11*^1  ~r  *  *  *    I    ^\rv^ 


If  I{a\,a'^,  •-')  =  yjr{c^^,"-c^n)I{a^,a^,'"\ 

where  yfr  is  a  polynomial  in  the  c's,  and  this  is  an  identity  in  the  a's  and 
c's^  then  sir  is  a  power  of  the  determinant  of  the  transformation. 

We  will  first  show  that  yjr^O  when  c=^0.    If  possible  let  d-^^^  •  •  •  dnn 
be  a  particular  set  of  values  of  the  c^/s  such  that 


while 


^11  '**  ^i« 


^ni  ' ' '  ^nn 


^0. 
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Then  the  transformation 


has  an  inverse 


X-^  =  o-|^^a:^j^  +  •  •  •  4"  ^inP^w 


Let  us  consider  a  special  set  of  a's  such  that  /(a^,  a^',  •••)=?£=  0.    Then 

Now  apply  the  inverse  transformation,  and  we  have 

which  is  contrary  to  our  hypothesis. 

Having  thus  proved  that  '^  can  vanish  only  when  c  =  0,  let  us 
break  up  i/r  into  its  irreducible  factors, 

Since  yfr  vanishes  whenever  yjti  =  0,  i/r^-  can  vanish  only  when  <?  =  0. 
Hence  by  the  theorem  for  n  variables  which  corresponds  to  Theorem  7, 
§  76,  yJTi  must  be  a  factor  of  c.  But  c  is  irreducible.  Hence  yjr^  can 
differ  from  e  only  by  a  constant  factor,  and  we  may  write 

It  remains  then  merely  to  prove  that  the  constant  -BT  has  the  value  1. 
For  this  purpose  consider  the  identity 

and  give  to  the  e^'s  the  values  which  they  have  in  the  identical 
transformation.  Then  c  =  1,  and  the  a^'s  are  equal  to  the  correspond- 
ing a's.     The  last  written  identity  therefore  becomes 

from  which  we  infer  that  ^=  1. 
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EXERCISES 

1.  Prove  that  if  a  polynomial  /  in  the  coefficients  ai,  02,  •••  of  an  n-ary  form  and 
the  coordinates  (?/i,  •••?/„)  of  a  point  has  the  property  of  being  merely  multiplied 
by  a  certain  rational  function  if/  of  the  coefficients  of  the  transformation  when  the 
form  and  the  point  are  subjected  to  a  linear  transformation,  then  i//  is  a  positive  or 
negative  power  of  the  determinant  of  the  transformation,  and  7  is  a  covariant. 

2.  Generalize  the  theorem  of  this  section  to  the  case  of  invariants  of  a  system 
of  forms. 

3.  Generalize  the  theorem  of  Exercise  1  to  the  case  of  a  system  of  forms  and 
a  system  of  points. 

4.  Prove  that  every  rational  invariant  of  a  form  or  system  of  forms  is  the 
ratio  of  two  integral  rational  invariants. 

5.  Generalize  the  theorem  of  Exercise  4  to  the  case  of  covariants. 

79.    The   Isobaric   Character   of  Invariants   and   Covariants.     In 

many  investigations,  and  in  particular  in  the  studj^  of  invariants  and 
covariants,  it  is  desirable  to  attach  a  definite  weight  to  each  of  the 
variables  with  which  we  have  to  deal.  To  a  product  of  two  or  more 
such  variables  we  then  attach  a  weight  equal  to  the  sum  of  the 
weights  of  the  factors,  and  this  weight  is  supposed  to  remain 
unchanged  if  the  product  is  multiplied  by  a  constant  coefficient. 
Thus  if  2 J,  ^2^  ^3  ^re  regarded  as  having  weights  w^^  w^^  w^  respectively, 
the  term  r  o 

would  have  the  weight         w^  -f  w^  +  2w^. 


If,  then,  having  thus  attached  a  definite  weight  to  each  of  the  vari- 
ables, we  consider  a  polynomial,  each  term  of  this  polynomial  will  be 
of  a  definite  weight,  and  hy  the  weight  of  a  polynomial  we  understand 
the  greatest  weight  of  any  of  its  terms  whose  coefficient  is  not  zero.  If 
moreover  all  the  terms  of  a  polynomial  are  of  the  same  weight,  the 
polynomial  is  said  to  be  isobaric. 

It  may  be  noticed  that,  according  to  this  definition,  a  polynomial 
which  vanishes  identically  is  the  only  one  which  has  no  weight,  while 
a  polynomial  which  reduces  to  a  constant  different  from  zero  is  of 
weight  zero.  Moreover  if  two  polynomials  are  of  weights  w-^^  and  w^^^ 
their  product  is  of  weight  w-^  +  w^.* 

*  The  conception  of  degree  of  a  polynomial  is  merely  the  special  case  of  the  con- 
ception of  weight  in  which  all  the  variables  are  supposed  to  have  weight  1.  The  con- 
ception of  being  isobaric  then  reduces  to  the  conception  of  homogeneity. 
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We  will  apply  this  conception  of  weight  first  to  the  case  in  which 
the  variables  of  which  we  have  been  speaking  are  the  coefficients 
ttp  a^^  "'  of  the  n-sivj  form 

We  shall  find  it  desirable  to  admit  n  different  determinations  of  the 
weights  of  these  a's ;  one  determination  corresponding  to  each  of  the 
variables  a:^,  •  •  •  x^- 

Definition  1.     ^  a^  is  the  coefficient  of  the  term 

in  an  n-ary  form^  we  assign  to  a^  the  weights  Pi-^Pi'*'" Pn  respectively 
with  regard  to  the  variables  x-^,  x^^  •••  x^- 

In  the  case  of  a.  binary  form, 

the  subscripts  of  the  coefficients  indicate  their  weights  with  regard 
to  x^,  while  their  weights  with  regard  to  x^  are  equal  to  the  differences 
between  the  degree  of  the  form  and  these  subscripts. 
As  a  second  example,  we  mention  the  quadratic  form 

n 

z^a^jX^Xj. 

Here  the  weight  of  any  coefficient  with  regard  to  one  of  the  vari- 
ables, say  Xj^  is  equal  to  the  number  of  times  j  occurs  as  a  subscript 
to  this  coefficient.* 

In  connection  with  this  subject  of  weight,  the  special  linear 
transformation 

(1)  ^  7    f  ^ 

is  useful.  If  a^  is  a  coefficient  which  is  of  weight  X  with  regard  to 
Oy,  the  term  in  which  this  coefficient  occurs  contains  x^^  and  therefore 

cb^  ^^^  iC  a^t 

*  For  forms  of  higher  degree,  a  similar  notation  for  the  coefficients  by  means  of 
multiple  subscripts  might  be  used.  The  weight  of  each  coefficient  could  then  be  at 
once  read  off  from  the  subscripts. 
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That  is 

Theorem  1.  The  weight  with  regard  to  Xj  of  a  coefficient  of  an 
n-ary  form  is  the  exponent  of  the  power  of  k  by  which  this  coefficient  is 
multiplied  after  the  special  transformation  (1). 

From  this  it  follows  at  once  that  an  isobaric  polynomial  of 
weight  X  with  regard  to  Xj  in  the  coefficients  (a^  a^^  •••)  of  an  ^-ary 
form  is  simply  multiplied  by  k^  if  the  form  is  subjected  to  the  linear 
transformation  (1). 

Moreover,  the  converse  of  this  is  also  true.  For  if  aj,  a'^^---  are 
the  coefficients  of  the  n-ary  form  after  the  transformation  (1),  and  if 
</>(«!,  ^2'  •••)^^  ^  polynomial  which  has  the  property  that 

</)(ai,  a^,  •••)  =  k^(t>{a^,  ^2'  •••)' 

this  being  an  identity  in  the  a's  and  also  in  k,  we  can  infer,  as  fol- 
lows, that  </>  is  isobaric  of  weight  \.  Let  us  group  the  terms  of  <^ 
together  according  to  their  weights,  thus  writing  <^  in  the  form 

(j)(a^,a^,  •")  =  (l)^(a^,a^,  ■  ■ -) -\- ^^{a^,  a^,  •••)+  *•• 

where  c^j,  c^g^  •  •  •  are  isobaric  of  weights  \j,  X^^  •  •  • .     We  have  then 

<l){a[,a'^,  '")=k^^(f)^{a^,a^,  ...)  + ^^2</)2(a|,  ^g^  •••)-f  •••  . 

But  on  the  other  hand 

(}>{a[,a'^,  '•')  =  k^<t>{a^,  a^,  ^ -)  =  k^(j)^{a^,  a^,  •••)  + ^^(^2(^1' ^2'  •••)^ • 

Comparing  the  last  members  of  these  two  identities,  we  see  that 

A.  ^  A-t  =  An  ^    *  *  * 

as  was  to  be  proved.     We  have  thus  established  the  theorem: 

Theorem  2.  A  necessary  and  sufficient  condition  that  a  poly- 
nomial (f)  in  the  coefficients  of  an  n-ary  form  he  simply  multiplied  by  k^ 
when  the  form  is  subjected  to  transformations  of  the  form  (1)  is  that  </> 
be  isobaric  of  weight  X  with  regard  to  Xj. 

By  means  of  this  theorem  we  can  show  that  the  use  of  the  word 
weight  introduced  in  §31  is  in  accord  with  the  definition  given  in 
the  present  section.  For  an  integral  rational  invariant  of  an  n-ary 
form  which,  according  to  the  definition  of  §31,  is  of  weight  X  will,  if 
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the  form  is  subjected  to  the  transformation  (1),  be  merely  multiplied 
by  k^  and  must  therefore,  according  to  Theorem  2,  be  isobaric  of 
weight  \  with  regard  to  Xj,     That  is: 

Theorem  3.  If  I  is  an  integral  rational  invariant  of  a  form  f 
ivJiich  according  to  the  definition  of  §  31  is  of  weight  X,  it  will  also  he  of 
weight  \  with  regard  to  each  of  the  variables  Xj  of  f  according  to  the 
definitions  of  this  section^  and  it  will  he  isobaric  with  regard  to  each  of 
these  variables. 

As  an  illustration  of  this  theorem  we  may  mention  the  discrimi- 
nant ^0^2-^2 

of  the  binary  quadratic  form 

A      1    "•       ^a-t  X-i  Xn   "l       anXn 

which  is  isobaric  of  weight  2  both  with  regard  to  x^  and  with  regard 
to  x^. 

The  reader  should  consider  in  the  same  way  the  discriminant  of 
the  general  quadratic  form. 

All  of  the  considerations  of  the  present  section  may  be  extended 
immediately  to  the  case  in  which  we  have  to  deal,  not  with  a  single 
form,  but  with  a  system  of  forms.  We  state  here  merely  the 
theorem  which  corresponds  to  Theorem  3. 

Theorem  4.     If  lis  an  integral  rational  invariant  of  a  system  of 
forms  which  according  to  the  definition  of  §  31  is  of  weight  \,  it  will  also 
he  of  iveight  \  with  regard  to  each  of  the  variables  Xj  of  the  system,  and 
it  ivill  he  isobaric  with  regard  to  each  of  these  variables. 

The  reader  may  consider  as  an  illustration  of  this  theorem  the 
resultant  of  a  system  of  linear  forms,  and  also  the  invariants  obtained 
in  Chapters  XII  and  XIII. 

We  saw  in  Theorem  5,  §  31,  that  the  weight  of  an  integral  rational 
invariant  cannot  be  negative.  This  fact  now  becomes  still  more 
evident,  since  the  weight  of  no  coefficient  is  negative.  Moreover, 
we  can  now  add  the  following  further  fact: 

Theorem  5.  An  integral  rational  invariant  of  a  form  or  system 
of  forms  cannot  he  of  weight  zero. 

For  consider  any  term  of  the  invariant  whose  coefficient  is  not 
zero.  This  term  involves  the  product  of  a  number  of  coefficients  of 
the  system  of  forms.    Since  none  of  these  coefficients  can  be  of  nega- 

Q 
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tive  weight,  the  weight  of  the  term  will  be  at  least  as  great  as  the 
weight  of  any  one  of  them.  But  any  one  of  them  is  at  least  of 
weight  1  with  regard  to  some  one  of  the  variables.  Hence  the  in- 
variant is  at  least  of  weight  1  with  regard  to  some  one  of  the 
variables,  and  hence  with  regard  to  any  of  the  variables. 

In  order,  finally,  to  be  able  to  extend  the  considerations  of  this 
section  to  the  case  of  covariants,  we  must  lay  down  the  following 
additional  definition: 

Definition  2.  If  the  sets  of  variables  (^/l,  •••  ?/^),  {zi^  •••  ^„),  ••• 
are  cogredient  with  the  variables  (ir^,  •••  x^^  of  a  system  of  n-ary  forms^ 
we  will  assign  to  yj^  Zp  •  •  •  the  weight  —  1  with  regard  to  Xj^  to  all  the 
other  y's,  z's^  etc.  the  weight  0. 

It  will  be  noticed  that  here  too,  when  we  perform  the  transform- 
ation (1),  each  of  the  variables  is  multiplied  by  a  power  of  k  whose 
exponent  is  the  weight  of  the  variable.  It  is  therefore  easy*  to 
extend  the  considerations  of  this  section  to  this  case,  and  we  thus 
get  the  theorem : 

Theorem  6.  ^  /  is  an  integral  rational  covariant  of  a  system 
of  forms  and  a  system  of  points  which  is  of  weight  X  according  to  the 
definition  of  §31,  it  will  also  be  of  weight  \  with  regard  to  each  of  the 
variables  of  the  system^  and  it  will  be  isobaric  with  regard  to  each  of 
these  variables. 

As  an  example  of  this  theorem  we  note  that  the  polar 

S^l^l  +  ^l(j/l^2  +  3/2^1)  +  ^2^2^2 

of  a  binary  quadratic  form  is  isobaric  of  weight  zero.  The  reader 
may  satisfy  himself  that  the  same  is  true  of  the  polar  of  the  general 
quadratic  form. 

80.    Geometric  Properties  and  the  Principle  of  Homogeneity.     It 

is  a  familiar  fact  that  many  geometric  properties  of  plane  curves  or 
surfaces  are  expressed  by  the  vanishing  of  an  integral  rational  func- 
tion of  the  coefficients  of  their  equations.     Take,  for  instance,  the 
surface 
(1)  f{x,  y,  2;  «i,  ^2,  •••)  =  ^^ 

*  Slight  additional  care  must  be  taken  here  on  account  of  the  possible  presence  of 
terms  of  negative  weight. 
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where  /  is  a  poljniomial  of  the  ^th  degree  in  the  non-homogeneous 
coordinates  x,  y,  z,  and  a^,  a^,"-  are  the  coefficients  of  this  polyno- 
mial ;  and  consider  the  relation 

(2)  4>{a^,  a^,'")  =  0, 

where  0  is  a  polynomial,  which  we  will  assume  to  be  of  at  least 
the  first  degree,  in  the  coefficients  ^^  a^,  "•  *  By  Theorem  3,  §  6, 
there  are  an  infinite  number  of  polynomials  of  the  hth.  degree  in 
(x,  y,  z)  whose  coefficients  satisfy  the  relation  (2)  and  also  an  infinite 
number  whose  coefficients  do  not  satisfy  this  relation.  In  other 
words,  all  polynomials  of  the  kt\\  degree  in  (a;,  y,  z)  may  be  divided 
into  two  classes,  A  and  B^  of  which  the  first  is  characterized  by  con- 
dition (2)  being  fulfilled,  while  the  second  is  characterized  by  this 
condition  not  being  fulfilled.  We  may  therefore  say  that  (2)  is  a 
necessary  and  sufficient  condition  that  /  have  a  certain  property^ 
namely,  the  property  of  belonging  in  class  A. 

The  simplest  examples,  however,  show  that  this  property  of  / 
need  not  correspond  to  a  geometric  property  of  the  surface  (1). 
To  illustrate  this,  let  ^  =  1,  so^that  we  have 

f=a^x  +  a^y-\-a^z  +  a^,   . 
and  consider  first  the  polynomial  in  the  a's  : 

The  vanishing  of  cf)  gives  a  necessary  and  sufficient  condition  that  / 
belong  to  the  class  of  homogeneous  polynomials  of  the  first  degree 
in  (a;,  ?/,  2),  and  thus  expresses  a  property  of  the  polynomial.  This 
same  condition,  a^  =  0^  also  expresses  a  property  of  the  plane /=  0, 
namely,  the  property  that  it  pass  through  the  origin. 

Suppose,  however,  that  instead  of  the  function  <j)  we  take  the 
polynomial  ,   _  -1 

The  vanishing  of  this  polynomial  also  gives  a  necessary  and  suffi- 
cient condition  that  the  polynomial  /  have  a  certain  property,  namely, 
that  its  constant  term  have  the  value  1.  It  does  not  serve  to  dis- 
tinguish planes  into  two  classes,  since  we  may  write  the  equation  of 
any  plane  (except  those  through  the  origin)  either  with  the  constant 
term  1  or  with  the  constant  term  different  from  1  by  merely  multi- 
plying the  equation  through  by  a  constant. 
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From  the  foregoing  it  will  be  seen  that  saying  that  a  surface  has 
a  certain  property  amounts  to  the  same  thing  as  saying  that  it 
belongs  to  a  certain  class  of  surfaces.* 

Theorem  1.  The  equation  (2)  expresses  a  necessary  and  sufficient 
condition  for  a  geometric  property  of  the  surface  (1)  when^  and  only 
when^  the  polynomial  </>  is  homogeneous. 

For  if  </)  is  non-homogeneous,  let  us  write  it  in  the  form 

where  <f>n  is  a  homogeneous  polynomial  of  the  ni\i  degree  and  each 
of  the  other  </)'s  which  is  not  identically  zero  is  a  homogeneous  poly- 
nomial of  the  degree  indicated  by  its  subscript.  Let  a^,  ag?  •••  be  a 
set  of  values  of  the  a's  for  which  </>„  and  at  least  one  of  the  other  <^/s 
is  not  zero,  and  consider  the  surface 

(3)  f{x,  y,  z\  ca[,  ca\,  ■..)=0. 

The  condition  (2)  for  this  surface  is 

+  </>o«  4'  •••)  =  ^- 

This  is  an  equation  of  the  wth  degree  in  c,  and  since  at  least  one 
of  the  coefficients  after  the  first  is  different  from  zero,  it  will  have 
at  least  one  root  c^  4^  0.  On  the  other  hand,  we  can  find  a  value 
^2  ^fc  0  which  is  not  a  root  of  this  equation.  Hence  the  surface  (3) 
satisfies  condition  (2)  if  we  let  c  =c^  and  does  not  satisfy  it  if  <?  =  c^. 
But  a  change  in  the  value  of  c  merely  multiplies  the  equation  (3) 
by  a  constant  and  does  not  change  the  surface  represented  by  it. 
Thus  we  see  that  one  and  the  same  surface  can  be  regarded  both  as 
satisfying  and  as  not  satisfying  condition  (2).  In  other  words,  if  </> 
is  non-homogeneous,  (2)  does  not  express  a  property  of  the  surface  (1). 
Assume  now  that  </>  is  homogeneous  of  the  n\h  degree,  and 
consider  the  class  A  of  polynomials  /  whose  coefficients  satisfy  equa- 
tion (2)  and  the  class  B  whose  coefficients  do  not  satisfy  this  equa- 
tion. Our  theorem  will  be  proved  if  we  can  show  that  we  have 
hereby  divided   the  surfaces  (1)  into  two  classes,  that  is,  that  if 

*  This  brief  explanation  must  not  be  regarded  as  an  attempt  to  define  the  concep- 
tion property^  for  no  specific  class  can  be  defined  without  the  use  of  some  property. 
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«!,  ^2^  •••  ^1*6  the  coefficients  of  a  polynomial  of  class  A  and  a'l,  ai^^  ••. 
the  coefficients  of  a  polynomial  of  class  B^  then  the  two  surfaces 

fix,  y,  2;  a![,  al,  •••)  =  0, 

cannot  be  the  same.     If  they  were  the  same,  the  coefficients  aj,  «2'  *** 
would  be  proportional  to  a![,  a^,  •••  (cf.  Theorem  7,  Corollary,  §  76), 

and  therefore  <^(^p  «2'  *••)"  ^"^^C^^i^  ^2'  "*)• 

But  this  is  impossible  since  by  hypothesis 

Thus  our  theorem  is  proved. 

This  theorem  admits  of  generalization  in  various  directions. 
Suppose  first  that  instead  of  a  single  surface  (1)  we  have  a  system 
of  algebraic  surfaces,  and  that  <^  is  a  polynomial  in  the  coefficients  of 
all  these  surfaces.  Then  precisely  the  reasoning  just  used  shows  that 
the  equation  <^  =  0  gives  a  necessary  and  sufficient  condition  for  a 
geometric  property  of  this  system  of  surfaces  when  and  only  when  (j) 
is  homogeneous  in  the  coefficients  of  each  surface  taken  separately. 

On  the  other  hand,  we  may  use  homogeneous  coordinates  in 
writing  the  equations  of  the  surfaces,  and  the  results  so  far  stated 
will  obviously  hold  without  change: 

Theorem  2.   Let 

f^{x,  y,  z,  t\  a^,  ^2'  •••)'  f%{^'>  y^  ^'  ^'  ^1'  ^2'  •••)'  ••• 
he  a  system  of  homogeneous  polynomials  in  {x,  y,  2,  ^)  whose  coefficients 
are  a^  a^,  •  •  • ;  ^j^,  h^,  •  •  • ;  etc, ;  and  let 

(j>{a^,  a^,  •••  ;   5^  ^2^  ••• ;  •••) 

be  a  polynomial  in  the  as,  h's,  etc.      Then  the  equation  </>  =  0  expresses 
a  necessary  and  sufficient  condition  that  the  system  of  surfaces 

/i=0,/2  =  0,  ... 

have  a  geometric  property  when,  and  only  when,  the  polynomial  (f>  is 
homogeneous  in  the  a's  alone,  also  in  the  Fs  alone,  etc. 

In  conclusion  we  note  that  all  the  results  of  this  section  can  be 
extended  at  once  to  algebraic  curves  in  the  plane  ;  or,  indeed,  to  the 
case  of  space  of  any  number  of  dimensions. 


230  INTRODUCTION    TO   HIGHER   ALGEBRA 

EXERCISE 

If,  in  Theorem  2,  besides  the  surfaces  yi  =  0,  /g  =  0,  •••  we  also  have  a  system 

and  if  <;^  is  a  polynomial  not  merely  of  the  a's,  6's,  etc.,  but  also  of  the  coordinates 
of  these  points,  prove  that  <f>  =  0  expresses  a  necessary  and  sufficient  condition 
that  this  system  of  surfaces  and  points  have  a  geometric  property  when  and 
only  when  ^  is  homogeneous  in  the  a's  alone,  in  the  6's  alone,  etc.,  and  also  in 
(xi,  yi,  2i,  ^i)  alone,  in  (x2,  y2,  22,  ^2)  alone,  etc. 

81.  Homogeneous  Invariants.  From  the  developments  of  the  last 
section  it  is  clear  that  the  only  integral  rational  invariants  which 
will  be  of  importance  in  geometrical  applications  are  those  which  are 
homogeneous  in  the  coefficients  of  each  of  the  ground-forms  taken 
separately.*  Such  invariants  we  will  speak  of  as  homogeneous  in- 
variants. It  will  be  found  that  all  the  invariants  which  we  have 
met  so  far  are  of  this  kind. 

An  important  relation  between  the  weight  and  the  various  de- 
grees connected  with  a  homogeneous  invariant  is  given  by  the  follow- 
ing theorem  : 

Theorem  1.     J^we  have  a  system  of  n-ary  forms^ 


(1) 


of  degrees  m^  mg,  •  •  •  respectively^  and  if 

I(a^,  a^,  •••  ;   5^,  h^,  •••;    •••) 

*  This  statement  must  not  be  taken  too  literally.  It  is  true  if  in  the  geometrical 
application  in  question  we  consider  the  variables  as  homogeneous  coordinates  and  if 
we  have  to  deal  with  the  loci  obtained  by  equating  the  ground-forms  to  zero.  While 
this  is  the  ordinary  way  in  which  we  interpret  invariants  geometrically,  other  inter- 
pretations are  possible.  For  instance,  instead  of  interpreting  the  variables  (oj,  y)  as 
homogeneous  coordinates  on  a  line  and  equating  the  binary  quadratic  forms 

/i  =  aix^  +  2  a^xy  +  asy^, 

f2=bix'^  +  2b2xy  +  b3y^ 
to  zero,  thus  getting  two  pairs  of  points  on  a  line,  we  may  interpret  (x,  y)  as  non- 
homogeneous  coordinates  in  the  plane,  and  consider  the  two  conies  /i  =  1,  /g  =  1.    With 
this  interpretation,  the  vanishing  of  the  invariant 

ai«3  —  «|  +  bibs  —  ?)|, 
which  is  not  homogeneous  in  the  a's  alone  or  in  tl;e  ?>'s  alone,  has  a  geometric  meaning 
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is  a  homogeneous  invariant  of  this  system^  of  weight  X,  and  of  degree  a 
in  the  a^s^  ^  in  the  5's,  etc.^  then 

(2)  m-^a -\- m^^ -{•  •••  =  nX, 

Subjecting  the  forms  (1)  to  the  linear  transformation 


(3) 


.  ^n  —  ^nV^'\    I      *  *  *      I    ^nn^n 


ni 


whose  determinant  we  will  denote  by  c,  we  get 

f  (t'    '••  t'  '    h'    h'    ...^ 


and,  since  by  hypothesis  /is^an  invariant  of  weight  \, 

(4)     7(ai,  4,  •♦.;   h[,h'^,  •.•;  •■■)  =  c^I {a^,  a^,  •••;   h^,h^,  ••.  ;  -.•). 

Every  a'  is  a  homogeneous  polynomial  in  the  c^^'s  of  degree  m^  every 
h'  of  degree  m^^  etc.  ;  and  since  /is  itself  homogeneous  of  degree  cc 
in  the  a's,  /3  in  the  J's,  etc.,  we  see  that  the  left-hand  side  of  (4)  is  a 
homogeneous  polynomial  of  degree  m-^a -\- m^l3 -\-  •••in  the  <?^y's. 
Equating  this  to  the  degree  of  the  right-hand  side  of  (4)  in  the  c.y's, 
which  is  evidently  n\  our  theorem  is  proved. 

An  additional  reason  for  the  importance  of  these  homogeneous 
invariants  is  that  the  non-homogeneous  integral  rational  invariants 
can  be  built  up  from  them,  as  is  stated  in  the  following  theorem: 

Theorem  2.     If  an  integral  rational  invariant  I  of  the  system  (1) 

he  written  in  the  form        ^     ^       ^  ^ 

•^  I=I^  +  I^+-+I, 

where  each  of  the  I-s  is  a  polynomial  in  the  a's^  6's,  etc.^  which  is 
homogeneous  in  the  a's  alone^  and  also  in  the  b's  alone^  etc.^  and  such 
that  the  sum  of  no  two  I^s  has  this  property,  then  each  of  the  functions 

is  a  homogeneous  invariant  of  the  system  (1). 
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This  theorem  follows  immediately  from  the  definition  of  an 
invariant.     For  from  the  identity, 

we  infer  at  once  the  identities, 

I^{a[,  4  ...  ;    5J,  b'^,  ...  ;  - • -)  =  c^I^{a^,  a^, ...  ;   5^,  h^,  ...  ;  ...), 

Ijla[,  a^,  ...  ;   h[,  h'^,  .••  ;  "')  =  €^IJ^a^,  a^,  ...  ;   5^,  b^,  ...  ;  ...). 

In  the  case  of  a  single  ?i-ary  form,  but  in  that  case  only,  we  have 
the  theorem : 

Theorem  3.  An  integral  rational  invariant  of  a  single  n-ary  form 
is  always  homogeneous. 

Let  f{x^,  •••^»;  %  «2'  •••) 

be  the  ground-form,  and  let  I  be  the  invariant.  By  Theorem  2  we 
may  write  j^  j^  +  j^+ ...  +  j^ 

where  Zj,  .-.  7^.  are  homogeneous  invariants.  Let  the  degrees  of  these 
homogeneous  invariants  in  the  a's  be  a^,  •••  aj^  respectively.  Their 
weights  are  all  the  same  as  the  weight  of  Z,  which  we  will  call  \, 
If,  then,  we  call  the  degree  of/,  m,  we  have,  by  Theorem  1, 

mai  =  n\  ma^  =  n\  ...  maj^  =  w\, 
from  which,  since  m  >  0,  we  infer 

That  is,  I^^ '"  I^  are  of  the  same  degree,  and  I  is  homogeneous. 

Theorem  4.  If  we  have  a  system  of  n-ary  forms  /j,  f^,  -"and 
a  polynomial  (f)  in  their  coefficients^  the  equation  </>  =  0  gives  a  necessary 
and  sufficient  condition  for  a  projective  property  of  the  system  of  loci  in 
space  of  n  —  1  dimensions^ 

/i  =  0,  /2  =  o, , 

when,  and  only  when,  </>  is  a  homogeneous  invariant  of  the  system  of 
forms  f. 
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If  </)  is  a  homogeneous  invariant,  its  vanishing  gives  a  necessary 
and  sufficient  condition  for  a  geometric  property  (cf.  §  80),  and  this 
property  must  be  a  projective  property  since  when  we  subject  the 
loci  to  a  non-singular  collineation,  </>  is  merely  multiplied  by  a  non- 
vanishing  constant. 

On  the  other  hand  let  <^  =  0  be  a  necessary  and  sufficient  condi- 
tion for  a  projective  property.  In  order  to  prove  that  (/>  is  an 
invariant  (it  must  be  homogeneous  by  §  80)  let  a^,  ^g,  •••  be  the 
coefficients  of  /-^;  5^,  h^,-"  the  coefficients  of  /g,  etc.;  and  suppose 
that  the  linear  transformation, 

(5) 


carries  over  f^  into  f[  with  coefficients  «J,  a'^---  \  f^  into/g  with  coeffi- 
cients 5J,  ^2'  "•  J  ^^^*  "^h^  polynomial  0  formed  for  the  transformed 
forms  is  ,     <^(«i,  4,-;  h[,V^,-.;  ••.), 

and  may,  since  the  a''s,  5''s,  •••  are  polynomials  in  the  a's,  6's,  •••  and 
the  c's,  be  itself  regarded  as  a  polynomial  in  the  a's,  5's,  •  •  •  and  the 
(?'s.  Looking  at  it  from  this  point  of  view,  let  us  resolve  it  into  its 
irreducible  factors. 

It  is  clear  that  at  least  one  of  the  factors  on  the  right  must  con- 
tain the  es.  Let  c^^  be  such  a  factor,  and  let  us  arrange  it  as  a  poly- 
nomial in  the  c's  whose  coefficients  are  polynomials  in  the  <i's,  5's, 

«**=•      Let  ylr(ai,a^,...;b,,h,,  ■■■;-) 

be  one  of  these  coefficients  which  is  not  identically  zero  and  which  is 
the  coefficient  of  a  term  in  which  at  least  one  of  the  c's  has  an  expo- 
nent greater  than  zero.  We  can,  now,  give  to  the  as^  5's,  •••  values 
which  Ave  will  denote  by  J.'s,  ^'s,  •••  such  that  neither  <f>  nor  yfr 
vanish  ;  and  consider  a  neigliborhood  iVof  the  point 

(-4 J,  A^^  ••• ;   B^^  B^,  ••• ;  •••) 
throughout  which 

(7)  (i>(a^,  a^,  .-.;   h^,  b^,  •••;  •••)=5tO, 

(8)  'f{a^,  ^2,  •••;  ^p  ^2,  •••;  "O^^O. 
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Let  us  now  restrict  the  a's,  5's,  •••to  the  neighborhood  iVand  ask 
ourselves  under  what  circumstances  we  can  have  </)j  =  0.  If  this 
equation  is  fulfilled,  we  see  from  (6)  that  <l>  vanishes  for  the  trans- 
formed loci,  while,  by  (7),  it  does  not  vanish  for  the  original  loci. 
Since,  by  hypothesis,  the  vanishing  of  (f)  gives  a  necessary  and 
sufficient  condition  for  a  projective  property,  a  transformation  (5) 
which  causes  </>  to  vanish  when  it  did  not  vanish  before  must  be  a 
singular  transformation.  That  is,  if  the  a's,  J's,  •••  are  in  the  neigh- 
borhood iV",  whenever  c^^  vanishes  the  determinant  c  of  (5)  vanishes. 
Moreover,  <^j  does  vanish  for  values  of  the  a's,  5's,  •••  in  JV",  for  if  we 
assign  to  the  a's,  5's,  •••  any  such  values,  </)^  becomes  a  polynomial  in 
the  c^/s,  which,  by  (8),  is  of  at  least  the  first  degree,  and  therefore 
vanishes  for  suitably  chosen  values  of  the  ^./s.  We  can  therefore 
apply  the  theorem  for  more  than  three  variables  analogous  to  Theorem 
8,  §  76,  and  infer  that  0^  is  a  factor  of  the  determinant  c  ;  and  conse- 
quently, since  this  determinant  is  irreducible  (Theorem  1,  §  61),  that 
^j  is  merely  a  constant  multiple  of  c. 

The  reasoning  we  have  just  applied  to  <^j  applies  equally  to  any 
of  the  factors  on  the  right  of  (6)  which  are  of  at  least  the  first  degree 
in  the  c^/s.     Accordingly  (6)  reduces  to  the  form 

(9)  (i>{a[,  a^,  •••  ;  h[,  h'^  •••;  •.•)  =  (?^x('^r  ^2^  ••'5  ^v  K  '"  5  •••)' 

where  %  no  longer  involves  the  Ci/s.  To  determine  this  polynomial 
%,  let  us  assign  to  the  Ci/s  the  values  0,  1  which  reduce  (5)  to  the 
identical  transformation.  Then  the  a^'s,  5''s,  •••reduce  to  the  a's, 
6's  ••• ,  while  c=l;  so  that  from  (9)  we  see  that 

Substituting  this  value  of  x  i^  C^)?  we  see  that  (f)  is  really  an  in- 
variant. 

In  order  to  avoid  all  misunderstanding,  we  state  here  explicitly 
that  if  we  have  two  or  more  polynomials,  </)j,  </>2,  •••  in  the  coefficients 
of  the  forms /*,  the  equations  (j)^  =  (l>^—  •  •  •  =  0  may  be  a  necessary  and 
sufficient  condition  for  a  projective  property  of  the  loci^-=0,  even 
though  <^j,  (f)^,  •••  are  not  invariants.  For  instance,  a  necessary  and 
sufficient  condition  that  the  two  lines 

d/-!  %C-t    ■^"  Oief  Oun  "x"  vtq  Jin  —   V, 
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coincide  is  the  vanishing  of  the  three  two-rowed  determinants  of 
the  matrix  /»/,/»- 

ti/'t  t^n  ^o 

^1     h     h 

none  of  which  is  an  invariant.  Or,  again,  a  necessary  and  sufficient 
condition  that  a  quadric  surface  break  up  into  two  planes,  distinct 
or  coincident,  is  the  vanishing  of  all  the  three-rowed  determinants 
of  its  matrix,  and  these  are  not  invariants.  In  this  case  we  can  also 
express  the  condition  in  question  by  the  identical  vanishing  of  a 
certain  contravariant,  namely,  the  adjoint  of  the  quadratic  form ; 
and  this  —  a  projective  relation  expressed  by  the  identical  vanish- 
ing of  a  covariant  or  contravariant  —  is  typical  of  what  we  shall 
usually  have  when  a  single  equation  </)  =  0  is  not  sufficient  to  express 
the  condition.  There  are,  however,  cases  where  the  condition  is  given 
by  the  vanishing  of  two  or  more  invariants;  cf.  Exercise  6,  §90. 


^EXERCISES 

1.  Prove  that  if  in  Theorem  1  our  system  consists  not  merely  of  the  ground- 
forms  (1)  but  also  of  certain  points 

and  we  have  not  an  invariant  /  but  a  covariant  of  weight  A,  and  of  degree  a  in  the 
a's,  jS  in  the  6's,  etc.,  rj  in  the  i/s,  ^  in  the  2's,  etc.,  then 

mia  +  1712/3  -{-•••=  nX.^  7]  -{-  ^  +  •••.. 

2.  Extend  Theorem  2  to  the  case  of  covariants.  Does  Theorem  3  admit  of 
such  extension  ? 

3.  Extend  Theorem  4  to  the  case  of  covariants. 

4.  Show  that  an  integral  rational  invariant  of  a  single  binary  form  of  odd 
degree  must  be  of  even  degree. 

5.  Show  that  the  weight  of  an  integral  rational  invariant  of  a  single  binary 
form  can  never  be  smaller  than  the  degree  of  the  form. 

6.  Express  the  condition  that  (a)  two  lines,  and  (b)  two  planes  coincide,  in 
the  form  of  the  identical  vanishing  of  a  covariant  or  contravariant. 

7.  Prove  that  a  polynomial  in  the  coefficients  of  a  system  of  n-ary  forms  which 
is  homogeneous  in  the  coefficients  of  each  form  taken  by  themselves,  and  which  is 
unchanged  when  the  forms  are  subjected  to  any  linear  transformation  of  determi- 
nant +  1,  is  an  invariant  of  the  system  of  forms. 

8.  Generalize  Exercise  7  to  the  case  of  covariants. 
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82.  Resultants  and  Discriminants  of  Binary  Forms.  If  we  inter- 
pret (rcj,  x^)  as  homogeneous  coordinates  in  one  dimension,  the  equa- 
tions obtained  by  equating  the  two  binary  forms 

(j>(x^,x^)  =  h^x^-^h^x^-''x^-\-  ^^-  -irh^x^ 

to  zero  represent  sets  of  points  on  a  line.  The  points  given  by  the 
equation /=  0  are  the  points  at  which  the  linear  factors  of  /  vanish, 
and  the  points  corresponding  to  </>  =  0  are  the  points  at  which  the 
linear  factors  of  <f>  vanish.  Since  two  binary  linear  forms  obviously 
vanish  at  the  same  point  when,  and  only  when,  these  linear  forms  are 
proportional,  it  follows  that  the  loci  of  the  two  equations /=  0,  </>  =  0 
have  a  point  in  common  when,  and  only  when,/  and  <^  have  a  common 
factor  other  than  a  constant.  Hence,  by  §  72,  a  necessary  and  suffi- 
cient condition  that  the  two  loci  /  =  0,  </>  =  0  have  a  point  in  common  is 
that  the  resultant  R  of  the  binary  forms  f  cj)  vanish. 

The  property  of  these  two  loci  having  a  point  in  common  is, 
however,  a  projective  property.     Thus,  by  Theorem  4,  §  81, 

Theorem  1.  The  resultant  of  two  binary  forms  is  a  homogeneous 
invariant  of  this  pair  of  forms. 

From  the  determinant  form  of  R  given  in  §  68  it  is  clear  that  R 
is  of  degree  m  in  the  a's  and  of  degree  n  in  the  5's.  Hence  by 
formula  (2),  §  81,  ^^  ^^^ 

Theorem  2.  The  weight  of  the  resultant  of  two  binary  forms  of 
degrees  m  and  n  is  mn. 

The  following  geometrical  problem  will  lead  us  to  an  important 
invariant  of  a  single  binary  form. 

Let  us  resolve  the  form  /,  which  we  assume  not  to  be  identically 
zero,  into  its  linear  factors  (cf.  formula  (4),  §  65), 

The  equation  /=  0  represents  n  distinct  points  provided  no  two  of 
these  linear  factors  are  proportional  to  each  other.  If,  however,  two 
of  these  factors  are  proportional,  we  say  that  /  has  a  multiple  linear 
factor,  and  in  this  case  two  or  more  of  the  n  points  represented  by 
the  equation  /=  0  coincide.  Let  us  inquire  under  what  conditions 
this  will  occur. 
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Form  the  partial  derivatives  : 

-    df 


(1) 


dx 


+  «2«^l-«i^2)(4'^l-«3^2)   •••  {^n^l-^n^2) 


df 


aU«l'^^l-«i^2)   •••   W-l^l-«n-1^2)- 

From  these  formulae  we  see  that  any  multiple  linear  factor  of 
/  is  a  factor  of  both  of  these  partial  derivatives. 

Conversely,  if  these  partial  derivatives  have  a  common  linear 
factor,  it  must  be  a  factor  of  /  on  account  of  the  formula, 

x^-^-\-x^—~=nf* 
o  x^  d  x^ 

a  formula  which  follows  imntediately  from  the  expressions, 

r  5/ 


(2) 


bx^ 
bx^ 


=  naQxl~^  +  {n  —  l)a^x\~'^x^  +   •  ••  +  dn-ix"^  S 
=  a^xl'^  +  2  a^xl-'^x^  +  •••  +  na^xlf^. 


But,  by  (1),  no  linear  factor  of  /  can  be  a  factor  of  ^f/dx-^  unless 
it  is  a  multiple  factor  of/.     Thus  we  have  proved 

Theorem  3.  A  necessary  and  sufficient  condition  that  f  have  a 
multiple  linear  factor  is  that  the  resultant  of  df/dx-^  and  df/bx^  vanish. 

Definition.  The  resultant  of  df/dx-^  and  bf/bx^  is  called  the  dis- 
criminant off. 

From  (2)  we  see  that  the  discriminant  of  /  may  be  written  as  a 
determinant  of  order  2  n  —  2  whose  elements,  so  far  as  they  are  not 
zero,  are  numerical  multiples  of  the  coefficients  a^,  a^,  •••  a^  of/". 
That  is,  this  discriminant  is  a  polynomial  in  the  «'s.  Moreover,  its 
vanishing  gives  a  necessary  and  sufficient  condition  that  the  locus 
/=Ohave  a  projective  property  (namely,  that  two  points  of  this 
locus  coincide).     Hence,  by  Theorem  4,  §  81,  this  discriminant  is  a 


*  This  is  merely  Euler's  Theorem  for  Homogeneous  Functions. 
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homogeneous  invariant,  whose  degree  and  weight  are  readily  deter- 
mined.    Thus  we  get  the  theorem  : 

Theorem  4.  The  discriminant  of  a  binary  form  of  the  nth 
degree  is  a  homogeneous  invariant  of  this  form  of  degree  2{n  —  1)  and 
of  weight  n(n  —  1). 

A  slight  modification  in  the  definition  of  the  discriminant  is  often 
desirable.  Let  us  write  the  binary  form  /,  not  in  the  above  form 
where  the  coefficients  are  a^,  a^,  •  •  •  a^-,  but,  by  the  introduction  of 
binomial  coefficients,  in  the  form 

f{x^,  x^)  =  a^xl  +  na^x^-^x^  +  ^    o~      ci'^^T'^^l  H h nan_^x^xl-^  +  a^xl. 

Then  we  may  write 

1  iZ  =  a^xl-^  +{n-  l)a^x^-^x^  +  (^-^X^-^)  a^r^a^l  + 
n  dx-^^  2 ! 

--^  =  a^x^-^  +  {n-  l)a^x^-^x^  +  (^  -  ^X^  -  ^)  ^^^^^-3^2  +  .. .  +  a^x'^'K 

n ^Xn  A  I 

We  may  then  define  the  discriminant  of  /  as  the  resultant  of  the  two 
binary  forms  just  written.  We  thus  get  for  the  discriminant  a 
polynomial  in  the  a's  which  differs  from  the  discriminant  as  above 
defined  only  by  a  numerical  factor,  and  for  which  Theorems  3  and  4 
obviously  still  hold.  If  this  last  definition  be  applied  to  the  case  of 
a  binary  quadratic  form,  it  will  be  seen  that  it  leads  us  precisely  to 
what  we  called  the  discriminant  of  this  quadratic  form  in  the  earlier 
chapters  of  this  book. 

EXERCISES 

1.  Prove  that  the  resultant  of  two  binary  forms  of  degrees  n  and  m  respec- 
tively is  irreducible. 

[Suggestion.  When  ^q  =  0,  Ji  is  equal  to  a^  times  the  resultant  of  two  binary 
forms  of  degrees  n  and  fti  —  1  respectively.  Show  that  if  this  last  resultant  is  irredu- 
cible, H  is  also  irreducible,  and  use  the  method  of  induction,  starting  with  the  case 

71=1,  W  =  l.] 

2.  Prove  by  the  methods  of  this  chapter  that  the  bordered  determinants  of 
Chapter  XII  are  invariants  of  weight  2. 

3.  The  following  account  of  Bezout's  method  of  elimination  is  sometimes 
given : 

If  /and  <^  are  polynomials  in  x  which  are  both  of  degree  n,  the  expression 
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vanishes,  independently  of  y,  for  a  value  of  x  for  which  both /and  ^  vanish,  and 
is  divisible  by  x  —  y,  since  it  is  zero  for  x  —  y.     Hence 


F{x,  y)  = 


f(x)cf>(y)  -  <f>ix)f(y) 
x-y 


is  a  polynomial  of  degree  (m  —  1)  in  a;  which  vanishes  for  all  values  of  y  when  x  is 
a  common  root  of/  and  <^.  Arranging  F  according  to  the  powers  of  y,  we  have 
the  expression 

+  2/(^10    +  C^^X  +  C^2^^  +    •"    +^1.  n-l^'^'O 

+  y^(^20+%^  +  ^22^^  +   •••   +  <^2,»^-l^''~0 
+ 

+  ?/"-i(c„_i,o+Cn-i,ia;  +  6-,i_i,  2^^  +  •••  +  c„_i,„_i:c"-i). 

If  this  function  is  to  vanish  independently  of  y,  the  coefficient  of  each  power 
of  y  must  be  zero.  This  gives  n  equations  between  which  we  can  eliminate  the  n 
quantities,  1  Xy  x^,  ••-  x'*-'^,  obtaining  the  resultant  in  the  form  of  the  determinant, 


R  = 


'00 


10 


^11- 


'^O,  w— 1 
^l,  n-1 


('n—l,Q('n  —  l,l  '**^n— 1,  n— 1 


:0. 


With  the  help  of  the  auxiliary  function  F  we  have,  in  this  case,  reduced  the 
resultant  to  a  determinant  of  the  nth  order,  while  that  obtained  by  the  method  of 
Sylvester  was  of  order  2  n. 

Criticise  this  treatment  and  make  it  rigorous,  applying  it,  in  particular,  to  the 
case  of  homogeneous  variables. 

4.  If /and  <fi  are  polynomials  in  (x,  y)  of  degrees  n  and  7n  respectively  and 
are  relatively  prime,  prove  that  the  curves/=  0,  ^  =  0  cannot  have  more  than  mn 
points  of  intersection. 

[Suggestion.  Show  first  that  the  coordinate  axes  can  be  turned  in  such  a  way  that 
no  two  points  of  intersection  have  the  same  abscissa,  and  that  the  equations  of  the  tvvo 
curves  are  of  degrees  n  and  m  respectively,  after  the  transformation,  in  y  alone.  Then 
eliminate  y  between  the  two  equations  by  Sylvester's  dyalitic  method.] 

5.  Prove  that  every  integral  rational  invariant  of  the  binary  cubic  is  a  con- 
stant multiple  of  a  power  of  the  discriminant. 

[Suggestion.  Show  that  if  the  discriminant  is  not  zero,  every  binary  cubic  can 
be  reduced  by  a  non-singular  linear  transformation  to  the  normal  form  x\  —  x\.  Then 
as  in  §48.] 


CHAPTER   XYIII 
SYMMETRIC    POLYNOMIALS 

83.   Fundamental  Conceptions.     S  and  S  Functions. 

Definition  1.  A  polynomial  F{x-^^  •••  x^)  is  said  to  he  symmetric 
if  it  is  unchanged  hy  any  interchange  of  the  variables  {x-^^  •  •  •  x^). 

It  is  not  necessary,  however,  to  consider  all  the  possible  permuta- 
tions of  the  variables  in  order  to  show  that  a  polynomial  is  sym- 
metric. If  we  can  show  that  it  is  unchanged  by  the  interchange  of 
every  pair  of  the  variables,  this  is  sufficient,  for  any  arrangement 
{Xfi^  rr^,  '"Xk)  may  be  obtained  from  {x^^  x^^  •••^n)  by  interchanging 
the  x^  in  pairs.  Thus,  if  «  9^  1,  interchange  x^  with  rr^;  then  in- 
terchange the  second  letter  in  the  arrangement  thus  obtained  with 
Xc^\  and  so  on.     Hence  we  have  the  following  theorem  : 

Theorem  1.  A  necessary  and  sufficient  condition  for  a  poly- 
nomial to  he  symmetric  is  that  it  he  uncha7iged  hy  every  interchange  of 
two  variables. 

A  special  class  of  symmetric  polynomials  of  much  importance  are 
the  2-f unctions,  defined  as  follows: 

Definition  2.  2  before  any  term  means  the  sum  of  this  term  and 
of  all  the  similar  ones  obtained  from  it  hy  interchanging  the  subscripts. 

Thus,  for  example, 

2jX~^  z^  X-^  -p  X2  ~i     ' '  •  ~r~  ^n^ 
V^^a^/S   —  ^<^^P  _L_  ^'^^^  4-   .  .  .  _J_    ^^^P 

+  xlx{  +  x^xl  H h  x^xl  (a  =?b  /3) 

+ 

+   /y.*.y.«   _J_    .  .  .    _1_   /r'^'r'* 
2    3  ^^  '^  '^2i    n 

+  .        .         .        . 

It  is  clearly  immaterial  in  what  order  the  exponents  a,  /3,  •••  are 
written.     Thus,  ^x'^x^xl  =  ^x^xlxl. 

240 
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If  we  consider  any  term  of  a  symmetric  polynomial,  it  is  evident 
that  the  polynomial  must  contain  all  the  terms  obtained  from  this 
one  by  interchanging  the  x's.  This  aggregate  of  terms  is  merely  a 
constant  multiple  of  one  of  the  S's  just  defined.  In  the  same  way  it 
is  clear  that  all  the  other  terms  of  the  symmetric  polynomial  must 
arrange  themselves  in  groups  each  of  which  is  a  constant  multiple 
of  a  2.     That  is, 

Theorem  2.  Every  symmetric  polynomial  is  a  linear  combination 
with  constant  coefficients  of  a  certain  number  of  2's. 

Among  these  S's  the  simplest  are  the  sums  of  powers  of  the  x's. 
For  the  sake  of  brevity  the  notation  is  used : 

Oy^  =  2jX^  ^  x-^  -\-  X2  -h  ' "  "t  ^71  \/c  =  1^  Zif  •"), 

It  is  sometimes  convenient  to  write  jS^^n. 

Theorem  3.  Any  symmetric  polynomial  in  the  x^s  can  be  ex- 
pressed as  a  polynomial  in  a  certain  number  of  the  iS^s. 

Since  every  symmetric  polynomial  is  a  linear  combination  of  a 
certain  number  of  S's,  in  order  to  prove  our  theorem  we  have  only 
to  show  that  every  E  can  be  expressed  as  a  polynomial  in  the  S's. 
Now  S!  —  ^'^  J-  ^'^  J-  ...  J-  'T* 

Sp  =  xl  +  x^2+  --■  +xl 
Hence,  if  a  ^  A  • 

S  S^  —  ^o.+P     I     /y.a+^  _1_    .  .  .    _!_  /r<*+^  _L_  ^a^^     I     /j.«/y.^     I      ... 

From  this  we  get  the  formula : 

(1)  2  xtxl  =  ^..S'^  -  S^^^  (a  ^  /3). 
If  a  =  /3,  we  have 

SI  =xf-\-xf-{-  '■'  +xi^  +  2x1x1+  2x1x1  +  '" 

=  S^,+ 2^x1x1. 
Hence 

(2)  2r»:r-  =  l(^2_^^^). 

Similarly,  by  multiplying  ^x\x\  by  aS^^,  we  get  the  following 
formulae  where  the  three  integers  a,  /3,  7  are  supposed  to  be  distinct: 

(3)  2  xl^xl  =  S^S,Sy  -  S^^^S,  -  ^.^.^S-,  -  S^^,  S^  +  2  S,^^^„ 

(4)  2  a;J44  ^  1  (jSIS,  -  ^2«  S,  -  2  <^.+,  S^  +  2  S^,^^), 

(5)  ^x'^x'.xl^liS'l-SS.A  +  iS,^). 
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The  proof  indicated  in  these  two  special  cases  may  be  extended 
to  the  general  case  as  follows  : 

If  we  multiply  together  the  two  symmetric  polynomials 

(6)  2 xlxl  ..^  xl,  S^  =  SrrJ,  {Jc  <  n) 

we  get  terms  of  various  sorts  which  are  readily  seen  to  be  all  con- 
tained in  one  or  the  other  of  the  following  polynomials,  each  of  these 
polynomials  being  actually  represented : 

/'7^        y  ^'^+^T^    ...   'T*     5"  ^a^^+A.    ...    ^K      V  ^a^^  ^K+A     V      a^(3  ^k^\ 

Consequently,  since  the  product  of  the  two  polynomials  (6)  is  sym- 
metric, it  must  have  the  form 

c^^x^-^^xl^  •••  xl  +  e^^x^xp-^  •••  irj-f -fc^S^Jo;^  ...  xl+^ 

where  <?j,  ••.  Cj^+^  are  positive  integers. 
Transposing,  we  may  write 


xl 


Cl 


—  c^x\x^^^  '"  x\— —  Cj^^x^x^  . . .  0:^+^] . 

Hence,  if  our  theorem  is  true  for  2a;J  ...  a;J,  it  is  also  true  for 
2a;J  •••  xl^y  But  we  know  it  is  true  for  k  =  l  (by  definition  of  the 
/S"s),  hence  it  is  true  for  Ar  =  2,  hence  for  A;  =  3,  and  so  on.  Thus 
our  theorem  is  completely  proved. 

84.  Elementary  Symmetric  Functions.  The  notation  ^x^x^  .  • «  < 
may  be  used  to  represent  ani/  S  in  n  variables.  If  y8=  7  =  ...  = 
z/  =  0,  this  becomes  ^x^  ov  S^;  if  7  =  ••.  =  i^  =  0,  it  becomes  2:r*a;| ; 
and  so  on. 

Let  us  now  consider  ^x^ x^  •••  x^  where  a,  fi,  "•  v,  are  all  0  or  1. 
The  following  n  cases  arise : 

a  =  l,  /3  =  7=...=z;  =  0,  Sa^j, 

«=y8  =  l,  7=...=i;  =  0,  ^X^X^j 

of  =  /9=  ...  =v=l,  x^x^  ...  rr^. 

The  extreme  case  a  =  /3  =  . .  •  =  z;  =  0  is  of  no  interest.  We  will 
represent  these  n  symmetric  polynomials  byjt?j,^2'  "'  Pw  respectively. 
They  are  called  the  elementary  symmetric  functions. 
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Theorem  1.  Any  symmetric  'polynomial  in  the  x'%  may  he  eay 
pressed  as  a  polynomial  in  the  p's. 

Since  any  symmetric  polynomial  in  the  x's  may  be  expressed  as  a 
polynomial  in  the  aS"s,  it  is  sufficient  to  show  that  every  aS' may  be 
expressed  as  a  pol3niomial  in  the  ^'s. 

Let  us  introduce  a  new  variable  x  and  consider  the  polynomial 

J  {^X  I'  X^<^  X(^^    ' '  •  Xn)  =■  \X        X^Jji^X        X^J  ' ' '  \X        Xf^j 

Using  the  factored  form  of/,  we  may  write 

T r  •••  -f- 


l/»t/  X         X-t         X  '~~  Xn  X  ~~~  Xin 

Since  /  vanishes  identically  when  x  =  Xi^  we  may  write 

f={x--  xf)  - p^x--^  -  rrf-i)  +  . . .. 
Accordingly, 

-^ —  =x^  '^  -\-(Xi—  p.)x'^  ^-\-(x^  —  p.Xi  H-  Po)x'^~^  -\-  "'<» 

X—  Xi 

^  =  nx""-^  +  (^1  -  np^)x^-'^  +  {S^  -  p^S^  +  np2)x''-^  -\ . 

CfX 

On  the  other  hand,  we  have 

-^  =  nx'^~'^  —  (n  —  l)p^x'^~^  +  (ti  —  2)jt?2^"~^  —  •  •  •. 
ux 

Hence,  equating  the  coefficients  of  like  powers  of  x  in  these  two 
expressions,  we  have 

S\-np^  =  -{n—l)p^, 


or 


(1) 


.  ^«-i  -  Pl'Sn-2  +F2>^n-s  -'••+(-  If  "^W^i?»-i  =  (  -  If'^Pn-v 

'S^-pA  +  2p^  =  0, 


.  ^»-l  -Pl^n-2  +P2^n-z  -•••+(-  '^y~\n  -  l)Pn-i  =  0. 

Now  consider  the  identities 

^"l - Pixr^  +  P2^?-^ -  -  4-(-l)>«  =  0         (1  =  1,  2,  ...  n). 
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Multiplying  these  identities  by  x^'^,  •••  a;^~"  respectively  and  adding 
the  results,  we  have 

Formulae  (1)  and  (2)  are  known  as  Newton'' 8  Formulce.  By 
means  of  them  we  can  compute  in  succession  the  values  of  /S'^,  aS'2,  ••• 
as  polynomials  in  the  ^'s : 


(3) 


Thus  our  theorem  is  proved. 

It  will  be  noted  that  Newton's  formuLie  (1)  cannot  be  obtained 
from  (2)  by  giving  to  k  values  less  than  n.  The  necessity  for  two 
different  sets  of  formulae  may,  however,  be  avoided  by  introducing 
the  notation  ^      =  ^->      =  ...  =zC\ 

Fn+l  — Pn+2  —  —  ^' 

Then  all  of  Newton's  formulae  may  be  included  in  the  following  form: 

(4)  S,-p,S,_,+  -+{-'^f-'P^-AH-'i-nPi.^(^     {k=l,2,-). 

Using  this  notation,  we  see  that  the  explicit  formulae  (3)  for 
expressing  the  S's  in  terms  of  the  jo's  are  wholly  independent  of  the 
number  n  of  the  a;'s. 

Since  the  formulae  referred  to  in  the  last  section  for  expressing  the 
2's  in  terms  of  the  aS"s  are  also  independent  of  n,  we  have  established 

Theorem  2.  If  we  introduce  the  notation  p^^^^  =/>«+2  =  •••  =  0  and 
use  Newton^s  Formulce  in  the  form  (4),  the  formula  for  expressing 
any  ^  as  a  polynomial  in  thep's  is  independent  of  the  number  n  of  the  x's. 

When  we  have  k  polynomials  in  n  variables 

we  say  that  there  exists  a  rational  relation  between  them  when,  and 
only  when,  a  polynomial  in  k  variables 

exists  which  is  not  identically  zero,  but  which  becomes  identically 
zero  as  a  polynomial  in  the  a:'s  when  each  z  is  replaced  by  the 
corresponding  /,  ^ (^f^^  "'fk)=0. 
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Theorem  3.  There  exists  no  rational  relation  between  the  elemen- 
tary symmetric  functions  in  n  variables  p^,  "•  Pn- 

For  let  F{zy,  '•-  Zn)  ^^  ^^7  polynomial  in  n  variables  which  is  not 
identically  zero,  and  let  (a^,  •••  a^  be  a  point  at  which  this  polynomial 
does  not  vanish.     Determine  {x-^^  •••  x^  as  the  roots  of  the  equation 

x""  -  a^x''-'^  +  a^x""-^  -•••+(-  !)"«„  =  0. 

For  these  values  of  the  x's^  the  ^'s  have  the  values  a^,  ••- a^,  and 
therefore  F{p-^^  '" Pn)  ^o^s  not  vanish  for  these  a;'s,  and  is  con- 
sequently not  identically  zero  as  a  polynomial  in  the  x^s.  Thus 
our  theorem  is  proved. 

Corollary.  There  is  only  one  way  in  which  a  symmetric  poly- 
nomial in  (ajj,  •  •  •  x^  can  be  expressed  as  a  polynomial  in  the  elementary 
symmetric  functions  p^^  •  •  •  jt?„. 

For  if/  is  a  symmetric  polynomial,  and  if  we  had  two  expressions 

^^'''^'  f{x,."-x,)^cl>,(p,,-^^p,), 

then  by  subtracting  these  identities  from  one  another  we  should  have 
as  an  identity  in  the  x's, 

4>i(Fv   ••'Pn)-<I>2(PV  '••Pn)  =  ^' 

This,  however,  would  give  us  a  rational  relation  between  the  p's, 
unless  <t>i{h^:.z„)  =  cl>,(z„-z^). 

Thus  we  see  that  the  two  expressions  for/ are  really  the  same. 

EXERCISES 

1.  Obtain  the  expressions  for  the  following  symmetric  polynomials  in  terms  of 
the  elementary  symmetric  functions  : 

2itX-^X2j  ^XyJC.^t  ^X\Xi^X^. 

2.  Prove  that  every  symmetric  polynomial  in  (xi,-"Xn)  can  be  expressed  in 
one,  and  only  one,  way  as  a  polynomial  in  S\,  •••  Sn- 

85.   The  Weights  and  Degrees  of  Symmetric  Polynomials.       We 

will  attach  to  each  of  the  elementary  symmetric  functions  p^  a  weight 
equal  to  its  subscript,  cf.  §  79. 

Theorem  1.  A  homogeneous  symmetric  polynomial  of  degree  m  in 
the  x's^  when  expressed  in  terms  of  the  p^s^  is  isobaric  of  weight  m. 


246  INTRODUCTION   TO   HIGHER   ALGEBRA 

Let 

be  such  a  polynomial.  Since  p-^  is  a  homogeneous  polynomial  of  the 
first  degree  in  the  x^^p^oi  the  second,  etc.,  any  term  of  <^,  when 
written  in  the  a:'s,  must  be  a  homogeneous  polynomial  of  degree  equal 
to  the  original  weight  of  the  term.  Thus,  for  example,  the  term 
6  PiP^Pz  whose  weight  is  13,  when  written  in  the  x^  will  be  a  homo- 
geneous polynomial  of  degree  13.  Accordingly  an  isobaric  group  of 
terms  when  expressed  in  terms  of  the  a;'s  will,  since  by  Theorem  3, 
§  84,  it  cannot  reduce  identically  to  zero,  be  homogeneous  of  the 
same  degree  as  its  original  weight.  If  then  <^  were  not  isobaric, 
/  would  not  be  homogeneous,  and  our  theorem  is  proved. 

Corollary.  If  f  is  non-homogeneous  and  of  the  mth  degree^  </>  is 
non-isoharic  and  of  weight  m. 

Theorem  2.  A  symmetric  polynomial  in  {x^^  •••  x^)-,  when  written 
in  terms  of  the  elementary  symmetric  functions  p-^^  •••  jt?^,  will  he  of  the 
same  degree  in  the  p's  as  it  was  at  first  in  any  one  of  the  x^s. 

Let/ be  the  symmetric  polynomial,  and  write 
/(xp  ^2,  •••  x^)=(t>(p^,  p^,  •••  p^), 

and  suppose /is  of  degree  m  in  x^  (and  therefore,  on  account  of  the 
symmetry,  in  any  one  of  the  a?'s),  and  that  cf)  is  of  degree  /jl  in  the  ^'s. 
We  wish  to  prove  that  m  =  fi.  Since  the  p's  are  of  the  first  degree 
in  a^j,  it  is  clear  that  m^fi. 

If  (f)  is  non-homogeneous,  we  can  break  it  up  into  the  sum  of  a 
number  of  homogeneous  polynomials  by  grouping  together  all  the 
terms  of  like  degree.  Each  of  these  homogeneous  polynomials  in 
the  p's  can  be  expressed  (by  substituting  for  the  p's  their  values  in 
terms  of  the  a;'s)  as  a  symmetric  polynomial  in  tlie  x's.  If  our  theo- 
rem were  established  in  the  case  in  which  the  polynomial  in  the  jo's 
is  homogeneous,  its  truth  in  the  general  case  would  then  follow  at 
once. 

Let  us  then  assume  that  <^  is  a  homogeneous  polynomial.  The 
theorem  is  obviously  true  when  ^  =  1,  since  then  p^=  —  x^.  It  will 
therefore  be  completely  proved  by  the  method  of  mathematical  in- 
duction if,  assuming  it  to  hold  when  the  number  of  x's  is  1,  2,  •••  ti— 1, 
we  can  prove  that  it  holds  when  the  number  of  a;'s  is  n. 
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For  this  purpose  let  us  first  assume  that  p^  is  not  a  factor  of  every 
term  of  <^.  Then  </>(2^i,  •••  Pn-v  ^)  ^^  ^^^  identically  zero  but  is  still 
a  homogeneous  polynomial  of  degree  /^  in  (j?^  "•  Pn-\)*  Now  let 
Xn  =  0.     This  makes  p^  =  0,  and  gives  the  identity 

(2)  /(^i,  •••^.-rO)^(/)(^;...K-i,  0), 

where  j^J,  -••  p^-^  are  the  elementary  symmetric  functions  of 
(a;^,  •••  x^_^),  and  /(a^^  •••  rz^^.j,  0)  is  a  symmetric  polynomial  of 
degree  m^  in  x^^  where  m^-^m.  From  the  assumption  that  our 
theorem  holds  when  the  number  of  a;'s  is  ti  —  1,  we  infer  from  (2) 
that  fi  =  m^-^m  ;  and  since  we  saw  above  that  /x  cannot  be  less  than 
m,  we  infer  that  ^  =  m^  as  was  to  be  proved. 

There  remains  merely  the  case  to  be  considered  in  which  p^  is  a 
factor  of  every  term  of  <^.  Let  p^  be  the  highest  power  of  j?^  which 
occurs  as  a  factor  in  (^.     Then 

where  (j)^  is  a  polynomial  of  degree  fx  —  k.  Putting  in  for  the  ^'s 
their  values  in  terms  of  the  a;'s,  we  get 

where 

(4)  /i (^1,  '"  x^)  =  <t>i (Pi,  . . .  p^). 

From  (3)  we  see  that  f^  is  of  degree  m  —  k  in  x^  and  from  (4),  since 

</)j  does  not  contain  p^  as  a  factor,  that  the  degrees  of  fi  in  a;^,  and 

of  01  in  the  p's  are  equal, 

m  —  k  =  /JL—  k. 

From  which  we  see  that  m  =  /x,  as  was  to  be  proved. 

The  two  theorems  of  this  section  are  not  only  of  theoretical 
importance,  they  may  also  be  put  to  the  direct  practical  use  of 
facilitating  the  computation  of  the  values  of  symmetric  polynomials 
in  terms  of  the  p's. 

In  order  to  illustrate  this,  let  us  consider  the  symmetric  function 

•/V^V   '"  X^j  =  2jX^X2X^. 

Since  /  is  homogeneous  of  the  fourth  degree  in  the  rr's,  it  will,  by 
Theorem  1,  be  isobaric  of  weight  4  in  the  p's.     Since  it  is  of  the 
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second  degree  in  x^,  it  will,  by  Theorem  2,  be  of  the  second  degree  in 
the  p's.     Hence 

(5)  ^^^2^3  =  Ap^Pz  +  -^PI  +  0^4' 

where  -4,  B^  and  C  are  independent  of  the  number  n  (Theorem  2, 
§  84),  and  may  be  determined  by  the  ordinary  method  of  unde- 
termined coefficients. 

Take  ?i  =  3,  so  that  p^  =  0.  Letting  x-^  =  0,  x^=x^  =  1,  we  have 
j0j  =  2,  jt?2  =  lii^3  =  ^'     Substituting  these  values  in  (5),  we  find  B  =  0. 

Letting  x^=  —1^  X2  =  x^=  1,  we  have p^  =  1^  p^  =  —  1^  p^  =  —  1, 
which  gives  A  =  l. 

Now  let  n  =  4:,  x^  =  X2=Xq  =  x^  =  1. 

From  this  we  find     p-^^  =  4,  p^  =  6,  p^  =  4,  jt?^  =  1. 
Substituting  this  in  (5)  gives  (7=  —4.     Hence 

I.xp^x^  =  p^p^-4p^. 


EXERCISES 

1.  The  symmetric  function 

f(xi,  •••  ar„)  =2:^1^2:^3  +  '^x'ixl  +  ^2:10:22:33:4 

is  homogeneous  of  the  fourth  degree  in  the  x's,  and  is  of  the  second  degree  in  xi ; 
hence,  when  written  in  terms  of  the  jo's,  it  will  have  the  same  form,  Apips  +  Bp^ 
+  Cp^,  as  the  above  example.     Compute  the  values  of  ^,  ^,  and  C. 

2.  If /(xi,  X2,  xs)  =  (xi  —  X2)^(^xi  —  Xs)\x2  —  x^)'^,  show  that 

f(xi,  X2,  X3)  =  -  27 pI  -  4 pi  +  18pij92i03  -  ^PiPz  +p\pl' 


86.   The  Resultant  and  the  Discriminant  of  Two  Polynomials  in 

One  Variable.     Let 

f{x)  =  x''  +  a^x""-^  +  ^22:"-^  +  •  •  •  +  a« 

=  (x—  a^{x  —  a^)  ••'  (x  —  a,^), 
(t){x)  =  x'^-{-h^x^-^-\-b^x'^-^-^  •••  +5^ 
=  {x-0,Xx-^^)-{x-0J, 

be  two  polynomials  in  x,  and  consider  the  product  of  the  mn  factors 


(1) 


(«2  -  ^l)("2  -^2)   '"  («2  -  ^m) 
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This  product  vanishes  when,  and  only  when,  at  least  one  of  the  «'s 
is  equal  to  one  of  the  fi's.  Its  vanishing  therefore  gives  a  necessary 
and  sufficient  condition  that  /  and  cfy  have  a  common  factor.  More- 
over, the  product  (1),  being  a  symmetrical  polynomial  in  the  as  and 
also  in  the  /8's,  can  be  expressed  as  a  polynomial  in  the  elementary 
symmetric  functions  of  the  as  and  /3's,  and  therefore  as  a  polynomial 
in  the  as  and  5's.  This  will  be  still  more  evident  if  we  notice  that 
the  product  (1)  may  be  written 

0(«i)</>(«2)  •••  <^(«n)- 

In  this  form  it  is  a  symmetric  polynomial  in  the  a's  whose  co- 
efficients are  polynomials  in  the  5's,  and  it  remains  merely  to  bring 
in  the  a's  in  place  of  the  a's. 

We  thus  see  that  the  product  (1)  may  be  expressed  as  a  poly- 
nomial F{a^^  "•  fin\  ^11  "'  ^m)  ii^  tlie  as  and  5's  whose  vanishing  gives 
a  necessary  and  sufficient  condition  that  /  and  </>  have  a  common 
factor.  In  §68  we  also  found  a  polynomial  in  the  a's  and  5's  whose 
vanishing  gives  a  necessary  and  sufficient  condition  that  /  and  ^ 
have  a  common  factor,  namely  the  resultant  M  of /and  <f). 

We  will  now  identify  these  two  polynomials  by  means  of  the 
following  theorem : 

Theorem  1.  The  product  (1)  differs  from  the  resultant  M  of  f 
and  (j)  only  by  a  constant  factor^  and  the  resultant  is  an  irreducible 
polynomial  in  the  a's  and  b's. 

In  order  to  prove  this  theorem  we  will  first  show  that  this  prod- 
uct (1),  which  we  will  call  F{a-^^  •••««;  ^v '"  ^m)^  is  irreducible. 
This  may  be  done  as  follows  :  Suppose  F  is  reducible,  and  let 
F(a^,  •••  a^  ;  5^,  .-  b^)=F^{a^,  ••.  a^\  5^,  .••  5^)  F^{a^,  »..  a^  ;  5^,  ...  5^), 
where  F^  and  F^  are  polynomials  neither  of  which  is  a  constant. 
Then,  since  the  a's  and  b's  are  symmetric  polynomials  in  the  a's  and 
yS's,  F^  and  F^^  may  be  expressed  as  symmetric  polynomials  <j)-^  and  <^2 
in  the  a's  and  yS's,  and  we  may  write 

<^i(«r  •••  ^«  ;  ^r  •••  Pm)  4>2i^v  •••«»;  ^v  "•  An) 

(«2  -  ^l)  (^2  -  ^2)"-  (S  -  ^m) 


^  (««  -  ^1)  {<^n  -   /^o)  •••  (%  -   Pm)' 


250  INTEODUCTION   TO   HIGHER  ALGEBRA 

The  factors  on  the  right-hand  side  of  this  identity  being  irreducible, 
we  see  that  <^j  must  be  composed  of  some  of  these  binomial  factors 
and  (^2  of  ^h®  others.  This,  however,  is  impossible,  since  neither  </>! 
nor  <^2  would  be  symmetric.     Hence  F  is  irreducible. 

Now,  since  F  =0  is  a  necessary  and  sufficient  condition  for  f{x) 
and  ^{x)  to  have  a  common  factor,  and  i2  =  0  is  the  same,  any  set  of 
values  of  the  a's  and  J's  w^hich make  F=  0  will  also  make  R=0.  Hence 
by  the  theorem  for  n  +  m  variables  analogous  to  Theorem  7,  §  76,  F  is 
a  factor  of  R.  Also,  since  jP  is  a  symmetric  polynomial  in  the  a's 
and  /3's  of  degree  m  in  each  of  the  as  and  n  in  each  of  the  /8's,  by 
Theorem  2,  §  85,  it  must  be  of  degree  m  in  the  a's  and  n  in  the  5's. 
But  R  is  of  degree  not  greater  than  m  in  the  a's  and  n  in  the  5's,  as  is 
at  once  obvious  from  a  glance  at  the  determinant  in  §  68.  Hence  F^ 
being  a  factor  of  R^  and  of  degree  not  lower  than  R^  can  differ  from 
it  only  by  a  constant  factor.     Thus  our  theorem  is  proved. 

Let  us  turn  now  to  the  question :  Under  what  conditions  does 
the  polynomial  f{x)  have  a  multiple  linear  factor  ?  Using  the  same 
notation  as  above,  we  see  that  the  vanishing  of  the  product 

K  -  «2)(«i  -  ^3)  •••  («i  -  ««)' 

is  a  necessary  and  sufficient  condition  for  this.  P  is  not  symmetric 
in  the  a's,  since  an  interchange  of  two  subscripts  changes  P  into  —P. 
If,  however,  we  consider  jP^  in  place  of  P,  we  have  a  symmetric 
polynominal,  which  can  therefore  be  expressed  as  a  polynomial  in 
the  a's. 

Moreover,  P=  0  is  also  a  necessary  and  sufficient  condition  that /(a?) 
have  a  multiple  linear  factor. 

On  the  other  hand,  it  is  easily  seen  that  f{x)  has  a  multiple  linear 
factor  when  and  only  when  f{x)  and  f'{x)  have  a  common  linear 
factor.  A  necessary  and  sufficient  condition  iov  f(x)  to  have  a  mul- 
tiple linear  factor  is  therefore  the  vanishing  of  the  resultant  of  f{x) 
and  f'(x).  This  resultant  we  will  call  the  discriminant  A  of  f{xy 
It  is  obviously  a  polynomial  in  the  coefficients  of/. 
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Theorem  2.  The  polynomials  F  and  A  differ  only  hy  a  constant 
factor^  and  are  irreducible. 

The  proof  of  this  theorem  is  similar  to  the  proof  of  Theorem  1, 
and  is  left  to  the  reader. 

EXERCISES 

1.  Compute  by  the  use  of  symmetric  functions  the  product  (1)  for  the  two 
polynomials  2  ,  , 

x^  +  hix  +  &2, 
and  compare  the  result  with  the  resultant  obtained  in  determinant  form. 

2.  Verify  Theorem  2  by  comparing  the  result  of  Exercise  2,  §  85,  with  the 
discriminant  in  determinant  form  of  the  polynomial 

x^  +  aix'^  +  Cl2,X  +  «3. 


CHAPTER   XIX 

POLYNOMIALS   SYMMETRIC   IN   PAIRS   OF   VARIABLES 

87.  Fundamental  Conceptions.  2  and  5  Functions.  The  variables 
(a^j,  •••  Xn)  which  we  used  in  the  last  chapter  may  be  regarded,  if  we 
wish,  not  as  the  coordinates  of  a  point  in  space  of  n  dimensions,  but 
rather  as  the  coordinates  of  n  points  on  a  line.  In  fact  this  is  the 
interpretation  which  is  naturally  suggested  to  us  by  the  ordinary 
applications  of  the  theory  of  symmetric  functions  (cf .  §  86).  Looked 
at  from  this  point  of  view,  it  is  natural  to  generalize  the  conception 
of  symmetric  functions  by  considering  n  points  in  a  plane, 

(1)  {^V  y\\  (^2'  ^2)'  •••  i^n^  Vn)^ 

Definition.     A  polynomial^ 

in  the  coordinates  of  the  points  (1)  is  said  to  he  a  symmetric  polynomial 
in  these  pairs  of  variables  if  it  is  unchanged  by  every  interchange  of 
these  pairs  of  variables. 

As  in  the  case  of  points  on  a  line,  we  see  that  it  is  not  necessary 
to  consider  all  the  possible  permutations  of  the  subscripts  in  order  to 
show  that  a  polynomial  F  is  symmetric.  It  is  sufficient  to  show  that 
F  is  unchanged  by  the  interchange  of  every  pair  of  the  points  (1). 

We  will  introduce  the  2  notation  here  precisely  as  in  the  case  of 
single  variables.     Thus,  for  example, 

2  xl^yl^  =  x\^y\-  +  x'^^yl^  +  •  •  •  +  x'^^yis 

2  xl^yl^x'^^yl-  =  x\^y{^xl^yl^  +  x\^y\^xl-yl^  +  •  •  -, 
and  so  on. 

As  in  the  case  of  single  variables,  it  is  clear  that  the  order  in 
which  the  pairs  of  exponents  a^,  ^^',  «2'  /^2>  •••  ^^®  written  is  imma- 
terial ;  and  also  that  every  symmetric  polynomial  in  the  pairs  of  variables 
(1)  is  a  linear  combination  of  a  certain  number  of  S  s. 

252 
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We  introduce  the  notation 

Theorem.  Any  symmetric  polynomial  F{x^^  y-^;  ••-  x^^  y^)  may 
he  expressed  as  a  polynomial  m  these  /S"s. 

The  proof  of  this  theorem  is  exactly  like  that  of  Theorem  3,  §  83, 
and  is  left  to  the  reader. 

88.   Elementary  Symmetric    Functions    of    Pairs    of    Variables. 

Every  2  function  of  n  pairs  of  variables  may,  by  giving  to  the  a's 
and  yS's  suitable  values,  be  written  in  the  form 

(1)  ^.xl^yl^x^^y^^  •••  x^-y^^n. 

Definition.  The  function  (1)  is  said  to  be  an  elementary  sym- 
metric function  of  the  pairs  of  variables  {x-^^  y^),  •••  (a^„,  y^)  when^  and 
only  when.  /i       a       -<  /  •      ^    ^  x 

hut  not  all  the  as  and  fi's  are  zero. 

We  shall  adopt  the  following  notation  for  these  elementary  sym- 
metric functions :       ^    =  ^  :r  r,    =^v 


It  is  clear  that  there  are  a  finite  number,  ^  n  {n-{-  3),  of  jt^^/s,  but 
an  infinite  number  of  S^^^. 

We  will  attach  to  each^  a  weight  with  regard  to  the  x''s  equal  to  its 
first  subscript  and  a  weight  tvith  regard  to  the  y's  equal  to  its  second 
subscript.  When  we  speak  simply  of  the  weight  of  p^j  we  will  mean 
its  total  weighty  that  is,  the  sum  of  its  subscripts. 

Theorem.  Any  symmetric  polynomial  F{x-^^  y^\  '•'  ^ni  yn)'^^y  ^^ 
expressed  as  a  polynomial  in  the  Pi/s. 

Since,  by  the  theorem  in  §  87,  any  such  polynomial  may  be 
expressed  as  a  polynomial  in  the  S^/s^  it  is  sufficient  to  show  that 
the  /S'y's  may  be  expressed  as  polynomials  in  the  Pi/s, 

^^        f J  =  \x^  +  fiy^,  I,  =  \t^  +  fiy^,  ...  f  „=  \x^  +  fxy„. 
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and  form  the  elementary  symmetric  functions  of  these  f's  : 

•^2  -  ^  f  1  ?2  =  ^(^^1  +  Wl)(^^2  +  /^^2) 

=  X2p20  +  Vi^ll  +  /^¥o2' 
•7^3  =  ^  ?1  ?2  ^3  =  ^^30  +  ^^/^i^21  +  ^^¥l2  +  /^¥o3' 

•^«  =  f  1  ?2  •  •  •  ?«  =  ^"Pno  +  ^"~Vi?n-i,  1  +  >^"~W«-2,  2  +   •  •  •  +  M''%«. 

Also  let  o-^  =  2|^f  (A:==l,  2,  •••) 

Let  a  and  jS  be  positive  integers,  or  zero,  but  not  both  zero. 

Then  <t^^^  =  Ef J+^  =  X«+^2:r«+^  +  X*^"^^- ^^2:^+^-1^^  +  ... 

But  by  Theorem  1,  §  84,  we  may  write 

where  ^  is  a  polynomial.     Hence 

where  '^  is  a  polynomial.  Regarding  this  as  an  identity  in  (X,  fi) 
and  equating  the  coefficients  of  the  terms  containing  X*/i^,  we  get  an 
identity  in  the  x's  and  y's, 

where  <l>  is  a  polynomial  in  the  p's.     Thus  our  theorem  is  proved. 

Theorem  3,  §  84,  does  not  hold  in  the  case  of  pairs  of  variables, 
as  relations  between  the  ^n(n  +  ^)  pi/s  do  exist;  for  example,  if 
n  =  2,  the  polynomial 

^P20P02  ~P2oPoi  '~Pl0P02  ~^  PloPllPol  ~' Pll 

vanishes  identically  when  the  |?'s  are  replaced  by  their  values  in 
terms  of  the  a;'s.     It  does  not  vanish  identically  when  ^  =  3. 

In  view  of  the  remark  just  made,  it  is  clear  that  the  represen- 
tations of  polynomials  in  pairs  of  variables  in  terms  of  the  jo^/s  will 
not  be  unique. 

For  further  information  concerning  the  subjects  treated  in  this 
section,  the  reader  may  consult  Netto's  Algebra^  Vol.  2,  p.  63. 
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EXERCISES 

1.  Prove  that  a  polynomial  symmetric  in  the  pairs  of  variables  (Xi^  y{)  and 
which  is  homogeneous  in  the  x's  alone  of  degree  n  and  in  the  ^'s  alone  of  degree  m 
can  be  expressed  as  a  polynomial  in  the  ^;y's  isobaric  of  weight  n  with  regard  to 
the  a:'s,  and  m  with  regard  to  the  ?/'s. 

2.  Express  the  symmetric  polynomial 

2  xly2ys 

in  terms  of  the  jo,/s  by  the  method  of  undetermined  coefficients,  mating  use   of 
Exercise  1. 

3.  A  polynomial  in  (xi,  ?/i,  2i;  x^,  ?/2,  22 ;  •••  x^,  ?/«,  z^)  which  is  unchanged  by 
every  interchange  of  the  subscripts  is  called  a  symmetric  polynomial  in  the  n 
points  {xi,  yi,  z^. 

Extend  the  results  of  this  section  and  the  last  to  polynomials  of  this  sort. 

89.  Binary  Symmetric  Functions.  The  pairs  of  variables  {x-^,  y-^, 
' ' '  {^ni  Vn)  ^^^y  ^^  regarded  as  the  homogeneous  coordinates  of  n 
points  on  a  line  as  well  as  the  non-homogeneous  coordinates  of  n 
points  in  a  plane.  It  will  then  be  natural  to  consider  only  sym- 
metric polynomials  which  are  homogeneous  in  each  pair  of  variables 
alone.  Such  polynomials  we  will  call  binary  symmetric  functions. 
Most  of  the  pijs  of  the  last  section  are  thus  excluded.  The  last 
n  +  1  of  them  {pn^j-^  Pn-\,v"  Pon)")  however,  are  homogeneous  of 
the  first  degree  in  each  pair  of  variables  alone.  We  will  call  them 
the  elementary  binary  symmetric  functions. 

Theorem  1.  ,Any  binary  symmetric  function  in  {xi'^yi'i  •••  ^ntyri) 
can  be  expressed  as  a  polynomial  in  (/>no'i^«-i,i'  "'  Pon)- 

If  we  break  up  our  binary  symmetric  function  into  S's,  it  is  clear 
that  each  of  these  S's  will  itself  be  a  binary  symmetric  function,  or, 
as  we  will  say  for  brevity,  a  binary  S.  It  is  therefore  sufficient  to 
prove  that  our  theorem  is  true  for  every  binary  E.  The  general 
binary  2  may  be  written 

^xl^y\^xl^yl^  •••  x^nyln         {a^^a^^  '"  ^a„), 

where,  if  we  denote  by  m  the  degree  of  this  E  in  any  one  of  the  pairs 
of  variables,  »  ^  .   n 

m  =  «!  -h  Pi  =  ^2  +  )^2  =  •*•  =^n  +  Pn' 

Let  us  assume  for  the  moment  that  none  of  the  y's  are  zero,  and  let 

XT  1      V  2  V  ™ 

-Ai  —  — ,  -Ao  =  —  ,  •••   ^\n  —  — • 
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Now  consider  the  elementary  symmetric  functions  of  these  X's; 


A 

=  i:Xi 

—  P\,n-\ 
Pon 

p. 

=  E  x^x,^ 

—  P2.n-2 

Pon 

•                •                • 

Pn 

=  -^1-^2   ' "  * 

^n 

•                •                • 

—  Pno 
Pon 

We  may  write 

(1)  ly^   2&, tJlsL  =  2 Xl'X^  -  X>  =  <I.(Pi,  ...  P„), 

where,  since  we  have  assumed  ai>a2=  "•  =«m  ^  is  a  polynomial 
of  degree  a^  in  the  P's  (Theorem  2,  §  85).     Hence  we  may  write 

(2)  ^(P  ,  •••  p\  —  ^^Po'^'Pi^^-v  '"  Pnoh 

Pih 
where  <^  is  a  homogeneous  polynomial  of  degree  a^ 
We  thus  get  from  (1)  and  (2) 

(3)  ^  ^^^l'   •  •  •   ^'^yy  =  Pon^iPon^  Pi,  n-v   ' '  *  Pno)^ 

an  equation  which  holds  except  when  one  of  the  ^'s  is  zero.  Since 
each  side  of  (3)  can  be  regarded  as  a  polynomial  in  the  x's  and  «/'s, 
we  infer,  by  Theorem  5,  §  2,  that  this  is  an  identity,  and  our  theorem 
is  proved. 

By  Theorem  1,  §  85,  <l>  is  isobaric  of  weight  a^-\-  a^-\-  •  •  •  +  «„  in 
the  P's.  Hence  2x\^y\^  •••  x'^^y^\  when  expressed  in  terms  of  these 
{n-{-l)pij^^  is  isobaric  of  weight  a^  +  «2  +  •"  +  «m  provided  we  count 
the  weight  of  the  ^^/s  with  regard  to  the  rr's.  Passing  back  now  to 
an  aggregate  of  a  number  of  such  S's,  we  get 

Theorem  2.  If  a  binary  symmetric  function  is  homogeneous  in 
.the  n  x^s  {or  y's)  of  degree  k^  it  will,  when  expressed  in  terms  of 
Pno-)  Pn-i,v  '"  Pon-i  ^^  isoharic  of  weight  k  with  regard  to  the  x^s  {or  y^s). 

We  have  seen  in  the  proof  of  Theorem  1  that  the  polynomial  <^  in 
(3)  is  a  homogeneous  polynomial  of  degree  a^  in  the  p's ;  so  that 
^^1**'^/^  •••  x^""  yj""  is  a  homogeneous  polynomial  of  degree  aj  +  ySj 
=  m  in  the  p's.     Hence 

Theorem  3.  Any  binary  symmetric  function  of  degree  m  in  each 
pair  of  variables  will,  when  written  in  terms  of  pnQ,pn-i,ii  '"Pon  ^^  ^ 
homogeneous  polynomial  of  degree  m  in  these  p's. 
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EXERCISES 

1.  Prove  that  no  rational  relation  exists  between  jOno,  •••  pom  and  hence  that  a 
binary  symmetric  function  can  be  expressed  as  a  polynomial  in  them  in  only  one 
way. 

2.  By  a  ternary  symmetric  function  is  meant  a  symmetric  polynomial  in  n 
points  (Xi,  yi,  zi)  which  is  homogeneous  in  the  coordinates  of  each  point. 

Extend  the  results  of  this  section  to  ternary  symmetric  functions.  Cf.  Exer- 
cise 3,  §  88. 

90.  Resultants  and  Discriminants  of  Binary  Forms.     It  is  the 

object  of  the  present  section  to  show  how  the  subject  of  the  re- 
sultants and  discriminants  of  binary  forms  may  be  approached  from 
the  point  of  view  of  symmetric  functions. 

J\p^\'!      2/  ^—      0*^1   ~i~  CI-IU/-I       OCn  ~x~   '**   "I     ^n^^ 

=  (^^2*^1  ^l'^2X^2*^l  ^2*^2/  "*  V^M'^l        ^ra'^2/' 

<I>{X^,  X^)  =  V?  +  h^T~^^2  +   •••   +  ^m^? 

=  (ffix,  -  ^[x^X^'^x,  -  P'^x^)  -  (/3;>i  -  yg^^a), 

be  two  binary  forms.  Each  of  these  polynomials  has  here  been 
written  first  in  the  unfactored  and  secondly  in  the  factored  form. 
By  a  comparison  of  these  two  forms  we  see  at  once  that  the  elemen- 
tary binary  symmetric  fractions  of  the  n  points 

(ccj,  a'l),  (a^,  al),  •••«,  <) 
are  a^,  —  a^,  a^,  •••  (  — l)"a„  ; 

and  of  the  m  points     (/SJ,  /5i'),(^2^  ^^2  )' ' ' '  (A.  I^'L) 
are  %,  _  5^,  63,  •••  (- If  5,,. 

Let  us  now  consider  the  two  linear  factors 

A  necessary  and  sufficient  condition  for  these  factors  to  be  propor- 
tional is  that  the  determinant 


vanish, 


P  = 


Let  us  form  the  product  of  all  such  determinants  : 
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The  vanishing  of  this  product  is  a  necessary  and  sufficient  con- 
dition that  at  least  one  of  the  linear  factors  of  /  be  proportional  to 
one  of  the  linear  factors  of  (j>,  that  is,  that/ and  </>  have  a  common 
factor  which  is  not  a  constant. 

We  may  obviously  reduce  P  to  the  simple  form 

In  this  form  it  appears  as  a  homogeneous  polynomial  of  the  mth 
degree  in  the  a's,  and  as  a  symmetric  polynomial  in  the  m  points 
{ffi^  P'i)'  Moreover,  it  is  obviously  a  binary  symmetric  function  which 
is  of  the  ^th  degree  in  the  coordinates  of  each  point.  Consequently, 
by  Theorem  3,  §  89,  it  can  be  expressed  as  a  homogeneous  polynomial 
of  the  nth.  degree  in  the  elementary  binary  symmetric  functions  of 
the  points  (/5J,  ySf),  that  is,  in  the  5's.  Thus  we  have  shown  that  the 
product  P  can  he  expressed  as  a  polynomial  in  the  a^s  and  5's  which  is 
homogeneous  in  the  as  of  degree  m  and  in  the  b^s  of  degree  n. 

In  §  72  we  found  another  polynomial  in  the  a's  and  5's,  whose 
vanishing  also  gives  a  necessary  and  sufficient  condition  for/ and  (/> 
to  have  a  common  factor,  namely,  the  resultant  i?.  We  will  now 
identify  these  polynomials  by  means  of  the  following  theorem  : 

Theorem  1.  The  product  P  differs  from  the  resultant  R  of  f  and 
<f>  only  by  a  constant  factor^  and  the  resultant  is  an  irreducible  poly- 
nomial in  the  a^s  and  Fs. 

We  may  show,  in  exactly  the  same  way  as  in  the  proof  of  Theo- 
rem 1,  §  86,  that  P,  when  expressed  as  a  polynomial  in  the  a's  and 
5's,  is  irreducible.  Since  P  =  0  and  R  =  0  each  give  a  necessary  and 
sufficient  condition  for  /and  <^  to  have  a  common  factor,  any  set  of 
values  of  the  a's  and  5's  which  make  P  —  0  will  also  make  R  =  0. 
Thus  by  Theorem  7,  §  76,  P  is  a  factor  of  R.  We  have  seen  that  P 
is  of  degree  m  in  the  a's  and  n  in  the  5's.  The  same  is  also  true  of 
R^  as  may  easily  be  seen  by  inspection  of  the  determinant  of  §  68. 
Hence,  P  being  a  factor  of  jR,  and  of  the  same  degree,  can  differ 
from  it  only  by  a  constant  factor.     Thus  our  theorem  is  proved. 

Let  us  now  inquire  under  what  conditions  the  binary  ioYmf(x^^  x^) 
has  a  multiple  linear  factor.  Using  the  same  notation  as  above,  we 
see  that  the  vanishing  of  the  product 

««^  -  ai<)««;  -  <40  -  («  -  a[a[D^ 


\^n-\^n        ^n-l^n  ) 


sPj («!,<;  •••<«;') 
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is  a  necessary  and  sufficient  condition  for  this.  P^  is  not  symmetric 
in  the  pairs  of  a's,  since  an  interchange  of  two  subscripts  changes  P^ 
into  —Pi-  If,  however,  we  consider  Pf  instead  of  P^,  we  have  a 
binary  symmetric  function  which  can  be  expressed  as  a  polynomial 
in  the  a's 

[Pi(«i,  <  ;    . .  •  «;,  0]2  =  P(«o,  . . .  a,). 

Moreover,  P  vanishes  when,  and  only  when,  P^  does.  Accordingly 
P—0  is  a  necessary  and  sufficient  condition  for  f{x^^  x^)  to  have  a 
multiple  linear  factor. 

But  the  vanishing  of  the  discriminant  A  (cf.  §  82)  of  f{x^^  x^) 
is  also  a  necessary  and  sufficient  condition  for  this, 

Theorem  2.  F  and  A  differ  only  by  a  constant  factor^  and  are 
irreducible. 

The  proof  of  this  theorem,  which  is  practically  the  same  as  that 
of  Theorem  1,  is  left  to  the  reader. 

If  we  subject  the  two  binary  forms  /  and  <^,  which  we  may  sup- 
pose written  in  the  factored  form,  to  the  linear  transformation 

{•^2,  —     21    1    '    ^22"^2' 

we  get  two  new  binary  forms 

(^1^1  ~  ^1^2)(^2*^l  ~  ^2^2)  '■'  \^n^\  ~~  -^71^2)' 

where  A'l  =      af  %  -  «-(?2i'  ^'j  =      l^j  ^11  -  ^'i^2V 

SO  that  ^fPj  -  A'.B^'  =  c{a'/^'j  -  a[^'!), 

where  c  is  the  determinant  of  the  transformation  (1). 

Since  the  linear  transformation  (1)  may  be  regarded  as  carrying 
over  thea's  and  /3's  into  the  A's  and  P's,  the  last  written  identity 
shows  us  that  aJ'/SJ-  —  aJ/3J'  is,  in  a  certain  sense,  an  invariant  of 
weight  1.  It  can,  however,  not  be  expressed  rationally  in  terms  of 
the  a's  and  5's.     Such  an  expression  is  called  an  irrational  invariant. 

Since  the  resultant  of  /  and  </>  is  the  product  of  mn  such  irra- 
tional invariants  of  weight  1,  it  is  evident  that  the  resultant  itself  is 
an  invariant  of  weight  mn.  Thus  we  get  a  new  proof  of  this  fact, 
independent  of  the  proof  given  in  §  82. 

A  similar  proof  can  be  used  in  the  case  of  the  discriminant  of  a 
binary  form. 
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EXERCISES 

Develop  the  theory  of  the  invariants  of  the  binary  biquadratic 
f{x^,  x^=ia^\  +  4  ayx\x^  +  6  a^\xl  +  4  a^x^xl  +  a^x\ 

=  {a'lx^  -  ajxg)  (al^'x^  -  a^x^)  (a'Jx^-a'^x^)  (a'^x^  -  a'^x^) 

along  the  following  lines : 

1.  Start  from  the  irrational  invariants  of  weight  2, 

A  =  (a'/a',  -  «[<)  (a>i  -  a^O, 
B  =  («['«;  -  a[a'i)(a'[a',  -  «;<), 

whose  sum  is  zero,  and  the  negatives  of  whose  ratios  are  the  cross-ratios  of  the 
four  points  «  <),  (<  <),  (a^,  a'^'),  (a[,  «;'). 

2.  Form  the  further  irrational  invariants  of  weight  2 

Ei=B-C,        E^  =  C-A,         Es=A-B', 

and  prove  that  every  homogeneous  symmetric  polynomial  in  Eu  Eo,  E3  is  a 
binary  symmetric  function  of  the  four  points  (a-,  a'i),  and  therefore  an  integral 
rational  invariant  of/. 

3.  In  particular 

G^^EiE^  +  E-zEs  -\-  E3E1,  Gz^EiEiEs 

are  homogeneous  integral  rational  invariants  of  weights  4  and  6,  and  of  degrees  2 
and  3  respectively.     Prove  that 

6^2= -365^2,        G^3=432^3, 
where  ^2 = ao  04  —  4  ai  as  -f  3  af^ 

^3=«oa2«4  +  2  aiGitts  —  aoal  —  a\a^  —  al- 
These  expressions  g2  and  g^  are  the  simplest  invariants  of/.* 

4.  Prove  that  the  discriminant  A  of /is  given  by  the  formula 

^=gl-21gl 

5.  If  A  =7^  0,  prove  that  ^3  =  0  is  a  necessary  and  sufficient  condition  that  the 
four  points  /=  0  form  a  harmonic  range ;  and  that  g2  =  ^  is  a  necessary  and 
sufficient  condition  that  they  form  an  equianharmonic  range.  (Cf.  Exercise  8, 
§33.) 

6.  Prove  that  ^2  =  ^3  =  0  is  a  necessary  and  sufficient  condition  that  /  have 
at  least  a  threefold  linear  factor.f 

*  They  are  among  the  oldest  examples  of  invariants,  having  been  found  by  Cayley 
and  Boole  in  1845. 

t  Notice  that  we  here  have  a  projective  property  of  the  locus  /  =  0  expressed  by 
the  vanishing  of  two  integral  rational  invariants  ;  cf.  the  closing  paragraph  of  §  81. 
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7.   If  A  is  the  absolute  irrational  invariant 

X-      ^ 


i.e.  one  of  the  cross-ratios  of  the  points  /=0,  prove  that  the  absolute  rational 

A 


invariant  „s 

J  —  — 


can  be  expressed  in  the  form 

4  (A^  -  A  +  ly 

27   (A-1)2A2  • 

8.  Prove  that  a  necessary  and  sufficient  condition  for  the  equivalence  of  two 
biquadratic  binary  forms  neither  of  whose  discriminants  is  zero  is  that  the  inva- 
riant /  have  the  same  value  for  the  two  forms. 

9.  Prove  that  a  necessary  and  sufficient  condition  for  the  equivalence  with 
regard  to  linear  transformations  with  determinant  +  1  of  two  biquadratic  binary 
forms  for  which  g2  and  g^  are  both  different  from  zero  is  that  the  values  of  ^2 
and  gs  be  the  same  for  one  form  as  for  the  other. 

10.  Prove  that  if  the  discriminant  of  a  biquadratic  binary  form  is  not  zero,  the 

form  can  be  reduced  by  means  of  a  linear  transformation  of  determinant  +  1  to  the 

normal  form  a    s  34 

4  XiX2>r-  g2XiX2  —  gsX2. 

11.  Prove  that  every  integral  rational  invariant  of  a  biquadratic  binary  form 
is  a  polynomial  in  ^2  and  gs. 


12.  Develop  the  theory  of  the  invariants  of  a  pair  of  binary  quadratic  forms 
along  the  same  lines  as  those  just  sketched  for  a  single  biquadratic  form. 

13.  Prove  that  every  integral  rational  invariant  of  a  pair  of  quadratic  forms 
in  n  variables  is  an  integral  rational  function  of  the  invariants  @o,  •••  @n  of  §  57. 

[  Suggestion.  Show  first  that,  provided  a  certain  integral  rational  function  of 
the  coefficients  of  the  quadratic  form  does  not  vanish,  there  exists  a  linear  transfor- 
mation of  determinant  +  1  which  reduces  the  pair  of  forms  to 

Pixl+^2XI+  •••    -^-^r^^l 

Then  show  that  every  integral  rational  invariant  of  the  pair  of  quadratic  forms  can  be 
expressed  as  a  binary  symmetric  function  of  (cti,  /3i),  (^2,  ^2),  •••  (««,  /Sn),  and  that 
the  ©'s  are  precisely  the  elementary  binary  symmetric  functions.] 


CHAPTER   XX 

ELEMENTARY  DIVISORS  AND   THE  EQUIVALENCE  OF 

X-MATRICES 

91.  X-Matrices  and  their  Elementary  Transformations.  The  theory 
of  elementary  divisors,  invented  by  Sylvester,  H.  J.  S.  Smith,  and, 
more  particularly,  Weierstrass,  and  perfected  in  important  respects 
by  Kronecker,  Frobenius,  and  others,  has,  in  the  form  in  which  we 
will  present  it,*  for  its  immediate  purpose  the  study  of  matrices 
(which  without  loss  of  "generality  we  assume  to  be  square)  whose 
elements  are  polynomials  in  a  single  variable  X.  Such  matrices  we 
will  call  X-matrices.f  The  determinant  of  a  X-matrix  is  a  polynomial 
in  X,  and  if  this  determinant  vanishes  identically,  we  will  call  the 
matrix  a  singular  X-matrix.  By  the  rank  of  a  X-matrix  we  under- 
stand the  order  of  the  largest  determinant  of  the  matrix  which  is 
not  identically  zero. 

We  have  occasion  here,  as  in  §  19,  to  consider  certain  elementary 
transformations  which  we  define  as  follows: 

Definition  1.  By  an  elementary  transformation  of  a  \-matrix 
we  understand  a  transformation  of  any  one  of  the  following  forms  : 

(a)  The  interchange  of  two  rows  or  of  tivo  columns. 

(h)  The  multiplication  of  each  element  of  a  row  {or  of  a  column)  hy 
the  same  constant  not  zero. 

(c)  The  addition  to  the  elements  of  a  row  (or  column)  of  the  products 
of  the  corresponding  elements  of  another  row  (or  column)  by  one  and 
the  same  polynomial  in  X. 

*  Various  modifications  of  the  point  of  view  here  adopted  are  possible  and  im- 
portant. First,  we  may  consider  matrices  whose  elements  are  polynomials  in  any  num- 
ber of  variables.  Secondly,  v/e  may  confine  ourselves  to  polynomials  whose  coefficients 
lie  in  a  certain  domain  of  rationality.  Thirdly,  we  may  approach  the  subject  from  the 
side  of  the  theory  of  numbers,  assuming  that  the  coefficients  of  the  polynomials  are 
integers.  The  simplest  case  here  would  be  that  in  which  the  elements  of  the  matrix 
are  themselves  integers  ;  see  Exercise  2,  §91,  Exercise  3,  §92,  and  Exercise  2,  §94. 

t  The  matrix  of  a  pencil  of  quadratic  forms  is  an  important  example  of  a  X-matri'^ 
to  which  the  general  theory  will  be  applied  in  Chapter  XXII. 
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If  we  pass  from  a  first  matrix  to  a  second  by  an  elementary  trans- 
formation, it  is  clear  that  we  can  pass  back  from  the  second  to  the 
first  by  an  elementary  transformation.  Thus  the  following  defini- 
tion is  justified: 

Definition  2.  Two  X-matrices  are  said  to  he  equivalent  if  it  is 
possible  to  pass  from  one  to  the  other  by  means  of  a  finite  number  of 
elementary  transformations. 

We  see  here  that  all  \-matrices  equivalent  to  a  given  matrix  are 
equivalent  to  each  other;  and,  as  in  §19,  that  two  equivalent  X-ma- 
trices  always  have  the  same  rank. 

The  rank  of  a  X-matrix  is  not,  however,  the  only  thing  which  is 
left  unchanged  by  every  elementary  transformation.  In  order  to 
show  this  we  begin  with 

Lemma  1.  If  the  polynomial  (f>(\)  is  a  factor  of  all  the  i-rowed 
determinants  of  a  \-matrix  a,  it  will  be  a  factor  of  all  the  i-rowed 
determinants  of  every  \-matrix^btained  from  a  by  means  of  an  elemen- 
tary transformation. 

If  the  transformation  is  of  the  type  (a)  or  (b)  of  Definition  1,  this 
lemma  is  obviously  true,  since  these  transformations  have  no  effect 
on  the  ^-rowed  determinants  of  a  except  to  multiply  them  by  con- 
stants which  are  not  zero.  If  it  is  of  the  type  ((?),  let  us  suppose  it 
consists  in  adding  to  the  elements  of  the  pth.  column  of  a  the  corre- 
sponding elements  of  the  qi\\  column,  each  multiplied  by  the  poly- 
nomial -v/r  (X).  Any  z-rowed  determinant  of  a  which  either  does  not 
involve  the  ^th  column,  or  involves  both  the  ^th  and  the  gth,  will 
be  unaffected  by  this  transformation.  An  z-rowed  determinant 
which  involves  the  pih  column  but  not  the  ^th  may  be  written  after 
the  transformation  in  the  form  A  ±  i/r(X)^,  where  A  and  B  are  z-rowed 
determinants  of  a;  so  that  here  also  our  lemma  is  true. 

Theorem  1.  If  Bl  and  b  are  equivalent  \-matrices  of  rank  r,  and 
Di{\)  is  the  greatest  common  divisor  of  the  i-rowed  determinants  {i'^r) 
of  a,  then  it  is  also  the  greatest  common  divisor  of  the  i-rowed  determi- 
nants of  b. 

For  by  our  lemma,  I>i(}C)  is  a  factor  of  all  the  ^-rowed  determi- 
nants of  b  ;  and  if  these  determinants  had  a  common  factor  of  higher 
degree,  this  factor  would,  by  our  lemma,  be  a  factor  of  all  the 
z-rowed  determinants  of  a;   which  is  contrary  to  hypothesis. 
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The  theorem  just  proved  shows  that  the  greatest  common  divisors 
i>j(X),  •••  Z>^(X)  are  invariants  with  regard  to  elementary  transfor- 
mations, or,  more  generally,  that  they  are  invariants  with  regard  to 
all  transformations  which  can  be  built  up  from  a  finite  number  of 
elementary  transformations.  In  point  of  fact  they  form,  along  with 
the  rank  r,  a  complete  system  of  invariants.  To  prove  this  we  now 
proceed  to  show  how,  by  means  of  elementary  transformations,  a 
X-matrix  may  be  reduced  to  a  very  simple  normal  form. 

Lemma  2.  If  the  first  element  *  /(X)  of  a  \-matrix  is  not  identically 
zero  and  is  not  a  factor  of  all  the  other  elements,  then  an  equivalent 
matrix  can  be  formed  whose  first  element  is  not  identically  zero  and  is  of 
lower  degree  than  f. 

Suppose  first  there  is  an  element  /^(X)  in  the  first  row  which  is 
not  divisible  by/(X)  and  lety  denote  the  number  of  the  column  in 
which  it  lies.  Dividing /^  by/  and  calling  the  quotient  q  and  the 
remainder  r,  we  have    y^(^>)  =  ^(x)/(X)  +  r(X). 

Accordingly,  if  to  the  elements  of  the/th  column  we  add  those  of  the 
first,  each  multiplied  by  —  ^(X),  we  get  an  equivalent  matrix  in  which  the 
first  element  of  the  yth  column  is  r(X),  which  is  a  polynomial  of  degree 
lower  than  /(X).  If  now  we  interchange  the  first  and  yth  columns, 
the  truth  of  our  lemma  is  established  in  the  case  we  are  considering. 

A  similar  proof  obviously  applies  if  there  is  an  element  in  the 
first  column  which  is  not  divisible  by/(X). 

Finally,  suppose  every  element  of  the  first  row  and  column  is 
divisible  by/(X),  but  that  there  is  an  element,  say  in  the  ^th  row  and 
yth  column,  which  is  not  divisible  by/(X).  Let  us  suppose  the  ele- 
ment in  the  first  row  and  yth  column  is  '</r(X)/(X),  and  form  an 
equivalent  matrix  by  adding  to  the  elements  of  the  yth  column 
—  '^(X)  times  the  corresponding  elements  of  the  first  column.  In 
this  matrix,  /(X)  still  stands  in  the  upper  left-hand  corner,  the  first 
element  of  the  yth  column  is  zero;  the  first  element  of  the  ^th  row 
has  not  been  changed  and  is  therefore  divisible  by  /(X) ;  and  the 
element  in  the  ith  row  and  yth  column  is  still  not  divisible  by  /. 
Now  form  another  equivalent  matrix  by  adding  to  the  elements  of 
the  first  column  the  corresponding  elements  of  the  yth  column.  The 
upper  left-hand  element  is  still  /(X),  while  the  first  element  of  the 

*  By  the  first  element  of  a  matrix  we  will  understand  the  element  in  the  upper  left- 
hand  corner. 
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iih  row  is  not  divisible  by  /(X).  This  matrix,  therefore,  comes  under 
the  case  already  treated  in  which  there  is  an  element  in  the  first 
column  which  is  not  divisible  by/(X),  and  our  lemma  is  established. 

Lemma  3.  If  we  have  a  \-matrix  whose  elements  are  not  all  iden- 
tically zero^  an  equivalent  matrix  can  he  formed  which  has  the  followiiig 
three  properties : 

{a)  The  first  element  f(X)  is  not  identically  zero. 

(b)  All  the  other  elements  of  the  first  row  and  of  the  first  column  are 
identically  zero. 

(c)  Every  element  neither  in  the  first  row  nor  in  the  first  column  is 
divisible  by  f{X). 

For  we  may  first,  by  an  interchange  of  rows  and  of  columns, 
bring  into  the  first  place  an  element  which  is  not  identically  zero. 
If  this  is  not  a  factor  of  all  the  other  elements,  we  can,  by  Lemma  2, 
find  an  equivalent  matrix  whose  first  element  is  of  lower  degree  and 
is  not  identically  zero.  If  this  element  is  not  a  factor  of  all  the 
others,  we  may  repeat  the  process.  Since  at  each  step  we  lower  the 
degree  of  the  first  element,  there  must,  after  a  finite  number  of  steps, 
come  a  point  where  the  process  stops,  that  is,  where  the  first  element 
is  a  factor  of  all  the  others.  We  can  then,  by  using  transformations  of 
type  (c)  (Definition  1),  reduce  all  the  elements  in  the  first  row  and  in 
the  first  column  except  this  first  one  to  zero,  while  the  other  elements 
remain  divisible  by  the  first  one.     Thus  our  lemma  is  established. 

Finally,  we  note  that  since  /(X)  in  the  lemma  just  proved  is  a 
factor  of  all  the  other  elements  of  the  simplified  matrix,  it  must,  by 
Theorem  1,  be  the  greatest  common  divisor  of  all  the  elements  of 
the  original  matrix. 

The  lemma  just  proved  tells  us  that  the  X-matrix  of  the  nth  order 
of  rank  r  >  0 


(1) 


a 


n 


m 


a 


n\ 


a. 


can  be  reduced  by  means  of  elementary  transformations  to  the  form 

/i(\)    0     ...       0 


(2) 


0  ^11        ••■        Kn-1 

"     ""-1,1     "■    ^n-l.»-l 
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where  /i(X)  ^  0  and  where  fi(\)  is  a  factor  of  all  the  5's.  The  last 
written  matrix  being  necessarily  of  rank  r,  the  matrix  of  the  (^— l)th 
order  i  i 

(3) 


^n-1,  ]     •••    ^n-l,n-i 


is   of   rank  r— 1.     Consequently,  if  r>l,  (3)  may   be   reduced   by 
means  of  elementary  transformations  to  the  form 

/^(X)    0       ...        0 


(4) 


0 


11 


'1,  n-2 


0 


C 


n-2,  1 


^n~ 


2,  n-2 


where  f^(\)  ^  0  and  where/2(X)  is  a  factor  of  all  the  es.  By  Theorem  1, 
/2(X),  being  the  greatest  common  divisor  of  all  the  elements  of  (4),  is  also  the 
greatest  common  divisor  of  all  the  i's,and  is  therefore  divisible  by/j(X). 
Now  it  is  important  to  notice  that  the  elementary  transformations 
which  carry  over  (3)  into  (4)  may  be  regarded  as  elementary  transforma- 
tions of  (2)  which  leave  the  first  row  and  column  of  this  matrix  unchanged. 
Thus  by  a  succession  of  elementary  transformations,  we  have  reduced 


(1)  to  the  form 


(5) 


/i(X)    0 

0      /2(^) 

0       0 


0 
0 

11 


0 

0 

'l,  n—2 


0        0 


'n-2,  1 


'n-2,  n-2 


where  neither  /^  nor  f^  vanishes  identically,  /^  is  a  factor  of  /g,  and 
f^  is  a  factor  of  all  the  cs. 

If  /•  >  2,  we  may  treat  the  (n  —  2)-rowed  matrix  of  the  c's,  which 
is  clearly  of  rank  r  —  2,  in  a  similar  manner.  Proceeding  in  this 
way,  we  finally  reduce  our  matrix  (1)  to  the  form 


(6) 


/i(^)     0     . 

0      f2iS)    ' 

-.00. 
••00. 

.  0 
■.  0 

0      0     . 

0       0     . 

"friS)    0     • 

..00. 

.  0 
•  0 

0        0 


0     0  ...  0 
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where  none  of  the/'s  is  identically  zero,  and  each  is  a  factor  of  the 
next  following  one. 

So  far  we  have  used  merely  elementary  transformations  of  the 
forms  (a)  and  (c),  Definition  1.  By  means  of  transformations  of  the 
form  (b)  we  can  simplify  (6)  still  further  by  reducing  the  coefficient 
of  the  highest  power  of  \  in  each  of  the  polynomials  fi(\)  to  unity. 
We  have  thus  proved  the  theorem : 

Theorem  2.  Every  X-matrix  of  the  nth  order  and  of  rank  r  can 
he  reduced  hy  elementary  transformations  to  the  normal  form 

U^{\)      0      ...      0     0  ...  0 
U^{\)  ...      0     0 


(7) 


0 


0 


0 


0 


U,{\)  0  ...  0 


0        0     ...      0     0  ...  0 

where  the  coefficient  of  the  highest  power  of  \  in  each  of  the  polynomials 
Ei(\)  is  unity,  and  -^^(X)  is  a  factor  of  E^^-^{X)for  2  =  1,  2,  ••.  r  —  1. 

By  Theorem  1,  the  greatest  common  divisor  of  the  2-rowed  determi- 
nants (i^  r)  of  the  original  matrix  is  the  same  as  the  greatest  common 
divisor  of  the  z-rowed  determinants  of  the  normal  form  (7)  to  which  it  is 
reduced.  These  last  mentioned  2- rowed  determinants  are,  however,  all 
identically  zero  except  those  which  are  the  product  of  i  of  the  E's.    Let 

(8)  ^,,(X)-E',^(\)  -  E,iS) 

be  any  one  of  these,  and  suppose  the  integers  k-^,  k^,  •••  k^  to  have 
been  arranged  in  order  of  increasing  magnitude.  We  obviously 
have  ^j  >  1,  ^^2  =  2»  •••  ^i  ^  ^*-  Consequently  E^  is  a  factor  of  Ej^ ,  E^ 
of  E,^,  etc.     Thus  El\)El\)  ...  E^X) 

is  seen  to  be  a  factor  of  (8),  and,  being  itself  one  of  the  z-rowed  de- 
terminants of  (7),  it  is  their  greatest  common  divisor.     That  is, 

Theorem  3.  The  greatest  common  divisor  of  the  i-rowed  determi- 
nants of  a  X-matrix  of  rank  r,  when  i  ^  r,  is 

I),(\)  =  Elx)E4\)  ■■■  E,{\), 

where  the  E^s  are  the  elements  of  the  normal  form  (7)  to  which  the  given 
matrix  is  equivalent. 

It  may  be  noticed  that  this  greatest  common  divisor  is  so  deter- 
mined that  the  coefficient  of  the  highest  power  of  X  in  it  is  unity. 
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We  come  now  to  the  fundamental  theorem: 

Theorem  4.  A  necessary  and  sufficient  condition  for  the  equiva- 
lence of  two  X-matrices  of  the  nth  order  is  that  they  have  the  same  rank 
r,  and  that  for  every  value  of  i  from  1  to  r  inclusive^  the  i-rowed  deter- 
minants of  one  matrix  have  the  same  greatest  common  divisor  as  the 
i-rowed  determinants  of  the  other. 

To  say  that  this  is  a  necessary  condition  is  merely  to  restate 
Theorem  1.  To  prove  it  sufficient,  suppose  both  matrices  to  be 
reduced  to  the  normal  form  (7),  where  we  will  distinguish  the 
normal  form  for  the  second  matrix  by  attaching  accents  to  the 
^'s  in  it.  If  the  conditions  of  our  theorem  are  fulfilled,  we  have,  by 
Theorem  3,  ^r^^^^  ^  :e,{\), 


and,  since  none  of  these  ^'s  are  identically  zero,  it  follows  that 

U'J^X)  =  nix)  (;=1,  2,  ...  r). 

Thus  the  normal  forms  to  which  the  two  X-matrices  can  be  reduced 
are  identical,  and  hence  the  matrices  are  equivalent,  since  two 
X-matrices  equivalent  to  a  third  are  equivalent  to  each  other. 
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1.   Reduce  the  matrix 


X 

1 

0 

0 

0 

0 

X 

0 

0 

0 

0 

0 

X 

0 

0 

0 

0 

0 

X-1 

0 

0 

0 

0 

0 

x-1 

by  means  of  elementary  transformations  to  the  normal  form  of  Theorem  2. 

Verify  the  result  by  finding  the  greatest  common  divisors  A(A.)  first  directly, 
and  secondly  from  the  normal  form. 

2.  By  an  elementary  transformation  of  a  matrix  all  of  whose  elements  are 
integers  is  understood  a  transformation  of  any  one  of  the  following  forms : 

(a)  The  interchange  of  two  rows  or  of  two  columns. 

(&)    The  change  of  sign  of  all  the  elements  of  any  row  or  column. 

(c)  The  addition  to  the  elements  of  one  row  (or  column)  of  the  products  of  the 
corresponding  elements  of  another  row  (or  column)  by  one  and  the  same  integer. 

Starting  from  this  definition,  develop  the  theory  of  matrices  whose  elements  are 
integers  along  the  same  lines  as  the  theory  of  X-matrices  was  developed  in  this  section. 
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92.  Invariant  Factors  and  Elementary  Divisors.  In  place  of  the 
invariants  Di(X)  of  the  last  section,  it  is,  for  most  purposes,  more 
convenient  to  introduce  certain  other  invariants  to  which  we  will 
give  the  technical  name  invariant  factors.  As  a  basis  for  the  defini- 
tion of  these  invariants  we  state  the  following  theorem,  which  is 
merely  an  immediate  consequence  of  Theorem  3,  §  91: 

Theorem  1.  The  greatest  common  divisor  of  the  i-rowed  determi- 
nants (^  =  2,  3,  •••  r)  of  a  \-matrix  of  rank  r  is  divisible  by  the  greatest 
common  divisor  of  the  (i—  l)-rowed  determinants  of  this  matrix. 

Definition  1.     If  &.  is  a  \-matrix  of  rank  r,  and 

Di\)  (»  =  1,  2,  ...  r) 

the  greatest  common  divisor  of  its  i-rowed  determinants  so  determined 
that  the  coefficient  of  the  highest  power  of  X  is  unity;  and  if  Dq(\)^1; 
then  the  polynomial 

(1)  UlX)^^^^  {i=l,2,...r) 

is  called  the  ith  invariant  factor  of  a,. 

This  definition  shows  that  these  ^'s  are  really  invariants  since 
they  are  completely  determined  by  the  i>'s  which  we  proved  to  be 
invariants  in  §  91.  Moreover,  by  multiplying  together  the  first  i  of 
the  relations  (1),  we  get  the  formula 

(2)  Dl\)  =  E,{X)El\)  -  Ei\)  (1  =  1,2,  ...  r). 

This  shows  us  that  the  -E^'s  completely  determine  the  i>'s,  and  since 
these  latter  were  seen  in  §  91  to  form,  together  with  the  rank,  a 
complete  system  of  invariants,  the  same  is  true  of  the  ^'s.     That  is. 

Theorem  2.  A  necessary  and  sufficient  condition  that  two  \-ma- 
trices  be  equivalent  is  that  they  have  the  same  rank  r,  and  that  the  inva- 
riant factors  of  one  be  identical  respectively  with  the  corresponding 
invariant  factors  of  the  other. 

Since,  in  the  case  of  a  non-singular  matrix  of  the  nth.  order, 
J)J\)  differs  from  the  determinant  of  the  matrix  only  by  a  constant 
factor,  we  see  that  in  this  case  the  determinant  of  the  matrix  is, 
except  for  a  constant  factor,  precisely  the  product  of  all  the  invari- 
ant factors.  This  is  the  case  which  is  of  by  far  the  greatest  impor- 
tance, and  the  term  invariant  factor  comes  from  the  fact  that  the  ^'s 
are  really  factors  of  the  determinant  of  the  matrix  in  this  case. 
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A  reference  to  Theorem  3,  §  91,  shows  that  our  invariant  factors 
are  precisely  the  polynomials  U^  which  occur  in  the  normal  form  of 
Theorem  2,  §  91 ;  and,  since  in  that  normal  form  each  ^  is  a  factor 
of  the  next  following  one,  we  have  the  important  result, 

Theorem  3.  If  E-^{\)^  ...  EJ(K)  are  the  successive  invariant  fac- 
tors of  a  \-matrix  of  rank  r,  then  each  of  these  Ws  is  a  factor  of  the  next 
following  one. 

This  theorem  enables  us  to  arrange  the  invariant  factors  of  a  X-ma- 
trix  in  the  proper  order  by  simply  arranging  them  in  the  order  of 
increasing  degree,  two  JS^'s  of  the  same  degree  being  necessarily 
identical. 

The  invariant  factors  (like  the  i>'s  of  the  last  section)  may  be 
spoken  of  as  rational  invariants  of  our  X-matrix  since  they  are  formed 
from  the  elements  of  the  X-matrix  by  purely  rational  processes, 
namely  the  elementary  transformations  of  §  91,  which  involve  only 
the  rational  operations  of  addition,  subtraction,  multiplication,  and 
divison.  In  distinction  to  these  the  elementary  divisors,  first  intro- 
duced by  Weierstrass,  are,  in  general,  irrational  invariants.*  These 
we  now  proceed  to  define. 

Definition    2.     If  a.  is  a  \-matrix  of  rank  r,  and  DX^)  is  the 

greatest  common  divisor  of  the  r-rowed  determinants  of  a,  then  the  linear 

factors  ^  ^        r   ^        ff 

*^  X— ct,  X  — ct  ,  X  — a  %    ••• 

of  DJi\)  are  called  the  linear  factors  of  ei.j- 

Since,  by  formula  (2),  i>^.(X)  is  the  product  of  all  the  invariant 
factors  of  a,  it  is  clear  that  each  invariant  factor  is  merely  the  prod- 
uct of  certain  integral  powers,  positive  or  zero,  of  the  linear  factors 
of  a.     We  may  therefore  lay  down  the  following  definition : 

*  German  writers,  following  Erobenius,  use  the  term  elementary  divisor  to  cover 
both  kinds  of  invariants.  This  is  somewhat  confusing,  and  necessitates  the  use  of 
modifying  adjectives  such  as  simple  elementary  divisors  for  the  elementary  divisors 
as  originally  defined  by  Weierstrass,  composite  elementary  divisors  for  the  ^'s.  On 
the  other  hand  Bromwich  {Quadratic  Forms  and  their  Classification  by  Means  of 
Invariant-factors^  Cambridge,  England,  1906)  proposes  to  substitute  the  term  invari- 
ant factor  for  the  term  elementary  divisor.  Inasmuch  as  this  latter  term  is  wholly 
appropriate,  it  seems  clear  that  it  should  be  retained  in  English  as  well  as  in  German 
in  the  sense  in  which  Weierstrass  first  used  it. 

t  It  will  be  noticed  that  if  a  is  non-singular,  the  linear  factors  of  a  are  simply  the 
linear  factors  of  the  determinant  of  a. 
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Definition  3.     Let  a.  be  a  \-matrix  of  rank  r,  and 

X  — a,  X— a',  \  —  a^\    ••• 

its  distinct  linear  factors.      Then  if 

El\)={\-ayi{\-a')%\-a"Yi    ...         (^  =  l,  2,   ...  r), 

are  the  invariant  factors  of  a.,  such  of  the  factors 

{\-a)S  (X-«XS        (X-a)«r, 

(\-t.')^i,        (\-a'yL,     (\-a'y;, 

(\-a"yi\         l\-a"Y^\    (^x-a"yr\ 

as  are  not  mere  constants  are  called  the  elementary  divisors  of  a,  each 
elementary  divisor  being  said  to  correspond  to  the  linear  factor  of  which 
it  is  a  power  * 

Since  the  invariant  factors  completely  determine  the  elementary- 
divisors  and  vice  versa^  it  is  clear  that  the  elementary  divisors  are  not 
merely  invariants,  but  that,  together  v^^ith  the  rank,  they  form  a 
complete  system  of  invariants.     That  is, 

Theorem  4.  A  necessary  and  sufficient  condition  that  two  X-ma- 
trices  be  equivalent  is  that  they  have  the  same  rank  and  that  the  elemen- 
tary divisors  of  one  be  identical  respectively  with  the  corresponding 
elementary  divisors  of  the  other. 

By  means  of  Theorem  3  we  infer  the  important  result : 
Theorem  5.      The  degrees  e^  of  the  elementary  divisors  correspond- 
ing to  any  particular  linear  factor  satisfy  the  inequalities 

^i^^i-i  (^'=2,  3,  ...  r). 

By  means  of  this  theorem  vre  can  arrange  the  elementary  divisors 
corresponding  to  any  given  linear  factor  in  the  proper  order  by  simply 
noticing  their  degrees. 

*  It  will  be  seen  that  the  definition  just  given  is  equivalent  to  the  following  one,  in 
which  the  conception  of  invariant  factors  is  not  introduced  : 

Definition.  Let  \  —  a  he  a  linear  factor  of  the  \-matrix  a  of  rank  r,  and  let  h  he 
the  exponent  of  the  highest  power  of\—a  ivhich  is  a  factor  of  all  the  i-rowed  determi- 
nants (i'^r)  of  a.  If  the  integers  ei  (which  are  necessarily  positive  or  zero)  are  defined 
hy  the  formula  et^h-h-i  (i  =  l,  2,   ...  r), 

then  such  of  the  expressions    /^_^yl   C\—aY^    .     CX  —  aY^ 

as  are  not  constants  are  called  the  elementary  divisors  of  a  which  correspond  to  the 
linear  factor  \—a. 
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EXERCISES 

1.  If  ^  =  0  and  \f/  =  0  are  two  conies  of  which  the  second  is  non-singular, 
show  how  the  number  and  kind  of  singular  conies  contained  in  the  pencil 
(f)  —  Xif/  =  0  depends  on  the  nature  of  the  elementary  divisors  of  the  matrix  of  the 
quadratic  form  cf>  —  Ai//. 

2.  Extend  Exercise  1  to  the  case  of  three  dimensions. 

3.  Apply  the  considerations  of  this  section  to  matrices  whose  elements  are 
integers.     (Cf.  Exercise  2,  §91). 

93.  The  Practical  Determination  of  Invariant  Factors  and  Elemen- 
tary Divisors.  The  easiest  general  method  for  determining  the 
invariant  factors  of  a  particular  X-matrix  is  to  reduce  it  by  means  of 
elementary  transformations  to  the  normal  form  of  Theorem  2,  §  91, 
following  out  step  by  step  the  reduction  used  in  the  proof  of  that  theo- 
rem. From  this  normal  form  the  invariant  factors  may  be  read  off ; 
and  from  these  the  elementary  divisors  may  be  computed,  although  only, 
in  general,  by  the  solution  of  equations  of  more  or  less  high  degree. 

There  are,  however,  many  cases  of  great  importance  in  which  the 
elementary  divisors  may  more  easily  be  obtained  by  other  methods. 
The  most  obvious  of  these  is  to  apply  the  definition  of  elementary 
divisors  directly  to  the  case  in  hand.  As  an  illustration,  we  mention  a 
matrix  of  the  nth  order  which  has  a  — X  as  the  element  in  each  place 
of  the' principal  diagonal,  while  all  the  other  elements  are  zero  except 
those  which  lie  immediately  to  the  right  of  or  above  the  elements  of 
the  principal  diagonal,  these  being  all  constants  different  from  zero : 


(1) 


a  —  \ 


0 


0      a  —  \Cc 


0      0 
0     0 


0   a-X 


(c^c^  •••<?n-i^O). 


The  determinant  of   this  matrix  is  (a  —  Xf.     The  determinant 
obtained   by  striking   out   the   first   column   and   the   last   row   is 
Cn-i'     Accordingly 


^1^2 


'»— 1* 


Z),(X)  =  (X-«)^  i>^i(X)  =  l,  ElX)  =  {\-a)\ 


Thus  we  see  that  (X  —  a^  is   the  only   elementary  divisor  of   this 
matrix,  while  the  invariant  factors  are  (X  —  aj^  and  n  —  1  I's. 
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This  direct  method  may  sometimes  be  employed  to  advantage  in 
conjunction  with  the  method  of  reduction  by  elementary  transfor- 
mations.    Cf .  Exercise  1  at  the  end  of  this  section. 

A  further  means  of  recognizing  the  elementary  divisors  in  some 
special  cases  is  furnished  by  the  following  theorems  whose  proofs, 
which  present  no  difficulty,  we  leave  to  the  reader: 

Theorem  1.  If  all  the  elements  of  a  X-matrix  are  zeros  except 
those  in  the  principal  diagonal^  and  if  each  element  of  this  diagonal 
which  is  not  a  constant  is  resolved  into  the  product  of  a  constant  hy  powers 
of  distinct  linear  factors  of  the  form  \  —  «,  \—  a\  •••,  then  these  powers 
of  linear  factors  will  he  precisely  the  elementary  divisors  of  the  matrix. 

Theorem  2.  If  all  the  elements  of  a.  \-matrix  are  zeros  except 
those  which  lie  in  a  certain  number  of  non-overlapping  principal  minors^ 
then  the  elementary  divisors  of  the  matrix  may  he  found  hy  talcing  the 
elementary  divisors  of  all  these  principal  minors. 

The  proof  of  this  theorem  consists  in  reducing  the  given  matrix 
to  the  form  referred  to  in  Theorem  1  by  means  of  elementary  trans- 
formations each  of  which  may  be  regarded  as  an  elementary  trans- 
formation of  one  of  the  principal  minors  in  question. 

It  should  be  noticed  that  this  theorem  would  not  be  true  if  the 
words  invariant  factors  were  substituted  in  it  for  elementary  divisors ; 
cf.  Exercise  3  below.  The  invariant  factors  may,  however,  be  com- 
puted from  the  elementary  divisors  when  these  have  been  found. 


1.   Prove  that  the  matrix 


EXERCISES 


is  equivalent  to 


X-a 

0 
0 

0 

X  —  a 

0 

1    o  o 
ft 

'    -  1            0            0 
0-1            0 
0            0-1 

0 
0 

1 

P' 
0 

0 

1 

x-a         0            0 
0       X-a         0 
0            0       X-a 

-  1 

0 

0 

0 

0 

0 

0 

- 1 

0 

0 

0 

0 

0 

0 

-1 

0 

0 

0 

0 

0 

0 

(X- 

a)2+^2 

1 

0 

0 

0 

0 

0                (X- 

a)2  +  ^2 

1 

0 

0 

0 

0 

0               (X- 

-  a)2  +  /?2 
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and  hence  that  its  elementary  divisors  are 

[X-  (a  +  pi)y,  i\-(a-l3i)y. 

2.  Generalize  Exei'cise  1  to  matrices  of  order  2n. 

3.  Find    (a)   the  elementary  divisors,  and  (h)  the  invariant   factors  of  the 

x\x-iy        0 

X(X  -  ly 


matrix 


0 
0 

X-  1 
0 


0 
0 
0 
X 


4.   Determine  the  invariant  factors  and  the  elementary  divisors  of  the  matrix 


2A. 
4A 

0 

X-  1 

3(A  -  1) 


3 

3(X  +  2) 
6X 
0 
1-  A 


0 

0 
X 
X-1 
2(A-1) 


1 
X  +  2 

2X 
0 
0 


X 
2X 

0 
0 
0 


Is  this  matrix  equivalent  to  the  matrix  in  the  exercise  at  the  end  of  §  91  ? 

5.   Devise  a  convenient  rational  process  for  computing  the  invariant  factors  of 
matrices  of  the  kinds  considered  in  Theorems  1  and  2. 

94.  A  Second  Definition  of  the  Equivalence  of  X-Matrices.      The 

definition  of  equivalence  of  X-matriees  which  we  have  used  so  far 
rests  on  the  elementary  transformations.  These  transformations  are 
of  such  a  special  character  that  this  definition  is  not  convenient  for 
most  purposes.  We  now  give  a  new  definition  which  we  will  prove 
to  be  coextensive  with  the  old  one. 

Definition.  Two  n-roiued  \-matrices  a  and  b  are  said  to  he  equiv- 
alent if  there  exist  two  non-singular  n-rowed  X-matrices  c  and  d  whose 
determinants  are  independent  of  X,  and  such  that 

(1)  b  =  cad.* 

Since  the  matrices  c  and  d  have,  by  hypothesis,  constant  determi- 
nants, the  inverse  matrices  c~^  and  d~^  will  also  be  X-matrices,  and  not 
matrices  whose  coefficients  are  fractional  rational  functions  of  X  as  would 
in  general  be  the  case  for  the  inverse  of  X-matrices.  Consequently,  if 
we  write  (1)  in  the  form 

(2)  a  =  c-^bd-i, 

we  see  that  the  relation  established  by  our  definition  between  the 

matrices  a  and  b  is  a  reciprocal  one,  as  is  implied  in  the  wording  of 

the  definition. 

*  We  use  here  and  in  what  follows  the  sign  =  between  two  X-matrices  to  denote 
that  every  element  of  one  matrix  is  identically  equal  to  the  corresponding  element  of 
the  other. 
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In  order  to  justify  the  definition  just  given,  we  begin  by  estab- 
lishing the 

Lemma.  If  a  and  b  are  n-rowed  \-matrices^  and  the  polynomial 
<^(X)  is  a  factor  of  all  the  i-roived  determinants  o/a,  it  is  a  factor  of  all 
the  i-rowed  determinants  of  ab  and  also  of  ba. 

For,  by  Theorem  5,  §  25,  every  ^-rowed  determinant  of  ab  and 
also  of  ba  is  a  homogeneous  linear  combination  of  certain  2-rowed 
determinants  of  a. 

Theorem  1.  If  a  and  b  are  equivalent  according  to  the  definition  of 
this  section^  they  are  also  equivalent  according  to  the  definition  of  §  91. 

For  in  this  case  there  exist  two  non-singular  \-matrices,  c  and 
d,  whose  determinants  are  constants,  such  that  relation  (1)  holds. 
Consequently,  by  Theorem  7,  §  25,*  a  and  b  have  the  same  rank  r. 
Let  D^X)  be  the  greatest  common  divisor  of  the  ^-rowed  determi- 
nants of  a,  where  i  ^  r.  By  our  lemma,  Di(\)  is  a  factor  of  all 
the  ^-rowed  determinants  of  ca,  and  therefore,  applying  the  lemma 
again,  it  is  a  factor  of  all  the  i-rowed  determinants  of  cad,  that  is,  of  b. 

We  can  infer  further  that  i)^(X)  is  the  greatest  common  divisor 
of  the  ^-rowed  determinants  of  b.  For  applying  to  relation  (2)  the 
reasoning  just  used,  we  see  that  the  greatest  common  divisor  of  the 
z-rowed  determinants  of  b  is  a  factor  of  all  the  z-rowed  determi- 
nants of  a,  and  cannot  therefore  be  of  higher  degree  than  DjiX). 

A  reference  to  Theorem  4,  §  91,  now  shows  us  that  a  and  b  are 
equivalent  according  to  the  definition  of  that  section. 

Theorem  2.  If  a.  and  b  are  equivalent  according  to  the  definition 
of  ^  91,  they  are  also  equivalent  according  to  the  definition  of  the  pres- 
ent section. 

We  begin  by  showing  that  if  we  can  pass  from  a  matrix  a  to  a 
matrix  a^  by  means  of  an  elementary  transformation,  one  of  the  fol- 
lowing relations  always  holds : 
(3)  ai  =  ca    or    a^^ad 

where  c  and  d  are  non-singular  matrices  whose  determinants  are 
independent  of  \.  To  prove  this  we  consider  in  succession  the 
elementary  transformations  of  the  forms  which  were  called  (a),  (5), 
(c),  in  Definition  1,  §  91. 

*  How  is  it  that  we  have  a  right  to  apply  this  theorem  to  X-matrices  ? 


276 


INTRODUCTION   TO  HIGHER   ALGEBRA 


(a)  Suppose  we  interchange  the  pth  and  ^th  rows.  This  can 
be  effected  by  forming  the  product  ca  where  the  matrix  c  may  be 
obtained  by  interchanging  the  pth.  and  ^'th  rows  (or  columns)  in  the 
unit  matrix 


0 
0 


0     0     0-..: 

Similarly  the  interchange  of  the  ^th  and  ^th  columns  of  a  may  be 
effected  by  forming  the  product  ac,  where  c  has  the  same  meaning 
as  before. 

In  each  of  these  cases,  c  may  be  regarded  as  a  non-singular 
X-matrix  with  constant  determinant,  since  its  elements  are  constants 
and  its  determinant  is  —  1. 

(b)  To  multiply  the  pth  row  of  a  by  a  constant  k,  we  may  form 
the  product  ca,  where  c  differs  from  the  unit  matrix  only  in  having 
k  instead  of  1  as  the  pth  element  of  the  principal  diagonal. 

Similarly,  we  multiply  the  pth  column  of  a  by  ^,  by  forming  the 
product  ac,  where  c  has  the  same  meaning  as  before. 

If  we  take  the  constant  k  different  from  zero,  c  may  be  regarded 
as  a  non-singular  X-matrix  with  constant  determinant. 

(c)  We  can  add  to  the  pth  row  of  a  <^(X)  times  the  ^'th  row  by 
forming  the  product  ca,  where  c  differs  from  the  unit  matrix  only  in 
having  </)(X)  instead  of  zero  as  the  element  in  the  ^th  row  and  ^'th 
column. 

Similarly  we  add  to  the  ^th  column  (/)(X)  times  the  pth  column 
by  forming  the  product  ac  where  c  has  the  same  meaning  as  before. 

The  matrix  c,  whose  determinant  is  1,  is  a  non-singular 
X-matrix. 

It  being  thus  established  that  one  of  the  relations  (3)  holds 
between  any  two  X-matrices  which  can  be  obtained  from  one  another 
by  an  elementary  transformation,  it  follows  that  two  matrices  a 
and  b  which  are  equivalent  according  to  the  definition  of  §  91  will 
satisfy  a  relation  of  the  form 


b  =  CpCp_i 


c^adida 


where  each  of  the  c's  and  d's  is  a  non-singular  X-matrix  of  constant 
determinant  which  corresponds  to  one  of  the  elementary  transforma- 
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tions  we  use  in  passing  from  a  to  b.  This  last  relation  being  of  the 
form 

b  =  cad, 

where  c  and  d  are  non-singular  X-matrices  with  constant  determi- 
nants, our  theorem  is  proved. 

We  have  now  completed  the  proof  that  our  two  definitions  of 
the  equivalence  of  \-matrices  are  coextensive. 

EXERCISES 

1.  If  a  denotes  the  matrix  in  Exercise  1,  §  91,  and  b  the  normal  form  of 
Theorem  2,  §  91,  for  this  matrix,  determine  two  A-matrices,  c  and  d,  such  that 
relation  (1)  holds. 

Verify  your  result  by  showing  that  the  determinants  of  c  and  d  are  constants. 

2.  Apply  the  considerations  of  this  section  to  matrices  whose  elements  are 
integers.     Cf.  Exercise  2,  §  91,  and  Exercise  3,  §  92. 

95.  Multiplication,  and  Division  of  \-Matrices.  We  close  this 
chapter  by  giving  a  few  developments  of  what  might  be  called  the 
elementary  algebra  of  X-matrices. 

Definition.  B^/  the  degree  of  a  X-matrix  is  understood  the  high- 
est degree  in  X  of  any  one  of  its  elements. 

For  a  X-matrix  of  the  k\X\  degree,  the  element  in  the  tth  row  and/th 
culamn  may  be  written     a^r^J^  +  <X^-i  H-  •••  +  af, 

and  at  least  one  of  the  coefficients  of  X*  (i.e.  one  of  the  %'s)  must 
be  different  from  zero.  If,  then,  we  denote  by  a^  the  matrix  of 
which  a]g^  is  the  element  which  stands  in  the  2th  row  and  yth  col- 
umn, we  get  the  theorem 

Theorem  1.  Every  X  matrix  of  the  kth  degree  may  be  written  in  the  form 
(1)  aoX^  +  aiX*-J+--4-a^  (ao=^0) 

where  aQ,  •  •  •  a^  are  matrices  with  constant  elements ;  and  conversely., 
every  expression  (1)  is  a  X-matrix  of  degree  h. 

Theorem  2.     The  product  of  two  X-matrices  of  degrees  k  and  I 

aoX*  +  a^X*-!  +  ..•  +  a^  (ao^  0) 

boV  +  biV-i  +  .-4-b,  (bo=^0) 

is  a  X-matrix  of  degree  h-\-l  provided  at  least  one  of  the  matrices  Blq  and 
bQ  is  non-singular. 
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For  this  product  is  a  \-matrix  of  the  form 


where  Cq  has  the  value  OQbQ  or  bQaQ  according  to  the  order  in  which 
the  two  given  matrices  are  multiplied  together.  By  Theorem  7, 
§  25,  neither  slq^q  nor  b^^aQ  is  zero  if  a.^  and  b^  are  not  both  singular. 
The  next  theorem  relates  to  what  we  may  call  the  division 
of  X-matrices. 

Theorem  3.  If  a.  and  b  are  two  \-matrices  and  if  b,  when  written 
in  the  form  (1),  has  as  the  coefficient  of  the  highest  power  of  X  a  non- 
singular  matrix^  then  there  exists  one,  and  only  one,  pair  of  \-matrices 
qi  and  r^  for  which  a  =  q^  b  +  r^ 

and  such  that  either  r^  =  0,  or  r^  is  a  \-matrix  of  lower  degree  than  b/ 
and  also  one  and  only  one  pair  of  \-matrices  qg  and  x^^for  which 

a  =  bq2  4-  r^ 

and  such  that  either  x^  =  0,  or  x^  is  a  X-matrix  of  lower  degree  than  b. 

The  proof  of  this  theorem  is  practically  identical  with  the  proof 
of  Theorem  1,  §  63. 

EXEHCISE 

Definition.  By  a  real  matrix  is  understood  a  matrix  whose  elements  are  real ; 
by  a  real  X-matrix,  a  matrix  tvhose  elements  are  real  polynomials  in  X;  and  by  a  real 
elementary  transformation,  an  elementary  transformation  in  which  the  constant  in  (b) 
and  the  polynomial  in  (c),  Definition  1,  §  91,  are  real. 

Show  that  all  the  results  of  this  chapter  still  hold  if  we  interpret  the  words 
matrix,  X-matrix,  and  elementary  transformation  to  mean  real  matrix,  real  X-matrix, 
and  real  elementary  transformation,  respectively. 


CHAPTER   XXI 

THE  EQUIVALENCE   AND   CLASSIFICATION   OF   PAIRS    OF 
BILINEAR   FORMS   AND   OF   COLLINEATIONS 

96.  The  Equivalence  of  Pairs  of  Matrices.  The  applications  of 
the  theory  of  elementary  divisors  with  which  we  shall  be  concerned 
in  this  chapter  and  the  next  have  reference  to  problems  in  which 
X-matrices  occur  only  indirectly.  A  typical  problem  is  the  theory 
of  a  pail"  of  bilinear  forms.  The  matrices  a  and  b  of  these  two  forms 
have  constant  elements,  and  WQ  get  our  X-matrix  only  by  consider- 
ing the  matrix  a  —  Xb  of  the  pencil  of  forms  determined  by  the  two 
given  forms.  It  will  be  noticed  that  this  matrix  is  of  the  first 
degree,  and  in  fact  we  shall  deal,  from  now  on,  exclusively  with 
X-matrices  of  the  first  degree. 

By  the  side  of  this  simplification,  a  new  difficulty  is  introduced, 
as  will  be  clear  from  the  following  considerations.  We  shall  subject 
the  two  sets  of  variables  in  the  bilinear  forms  to  two  non-singular 
linear  transformations  whose  coefficients  we  naturally  assume  to  be 
constants,  that  is,  independent  of  X.  These  transformations  have  the 
effect  of  multiplying  the  X-matrix,  a  —  Xb,  by  certain  non-singular 
matrices  whose  elements  are  constants  (cf.  §  36)  and  therefore,  by 
§  94,  carry  it  over  into  an  equivalent  X-matrix  which  is  evidently  of 
the  first  degree.  The  transformations  of  §  94,  however,  were  far 
more  general  than  those  just  referred  to,  so  that  it  is  not  at  all  ob- 
vious whether  every  X-matrix  of  the  first  degree  equivalent  to  the 
given  one  can  be  obtained  by  transformations  of  the  sort  just  re- 
ferred to  or  not. 

These  considerations  show  the  importance  of  the  following 
theorem  : 

Theorem  1.  If  a^,  a^^  b^,  bg  are  matrices  with  constant  elements 
of  which  the  last  two  are  non-singular^  and  if  the  \-matrices  of  the  first 

degree  mj  =  a^  -  Xbj,         m2  =  ag  —  Xb2 
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are  equivalent ^  then  there  exist  two  non- singular  matrices^  p  and  q, 
whose  elements  are  independent  of  X,  and  such  that 

(1)  m2  =  pmiq. 

Since  m^  and  nig  are  equivalent,  there  exist  two  non-singular 
X-matrices,  p^  and  q^,  whose  determinants  are  constants  and  such  that 

(2)  m2  =  Pomiqo. 

The  matrix  q^  has,  therefore,  an  inverse,  qj"^,  which  is  also  a 
X-matrix. 

Let  us  now  divide  p^^  by  m.^  and  q^  ^  by  nij  by  means  of  Theorem 
3,  §  95,  in  such  a  way  as  to  get  matrices  p^,  p,  s^,  s  which  satisfy  the 
relations 

(3)  Po  =  m2Pi  +  p,  qo^  =  Simi4-s, 

p  and  s  being  matrices  whose  elements  are  independent  of  X.  From 
(2)  we  s^et  _, 

Substituting  here  from  (3),  we  have 

vcL^^^m.^  -\-  pm^  =  nigS^mj  4-  ni2S, 
or 

(4)  mgCPi  -  Si)mi  =  m2  s  -  pnii. 

From  this  identity  we  may  infer  that  p^  =  s^  and  therefore 

(5)  m2S  =  pmi. 

For  if  pj  — Sj  were  not  identically  zero,  m2(px  — s^)  would  be  a 
X-matrix  of  at  least  the  first  degree  (cf .  Theorem  2,  §  95),  and  hence 
the  left-hand  side  of  (4)  would  be  a  X-matrix  of  at  least  the  second 
degree.  But  this  is  impossible,  since  the  right-hand  side  of  (4) 
is  a  X-matrix  of  at  most  the  first  degree. 

If  we  knew  that  p  and  s  were  both  non-singular,  our  theorem 
would  follow  at  once  from  (5) ;  for  we  could  write  (5)  in  the  form 

(6)  m2  =  pmiS-i 

and  p  and  s~^  would  be  non-singular  matrices  with  constant  elements. 
Moreover,  we  see  from  (5)  that  p  and  s  are  either  both  singular  or 
both  non-singular.  Our  theorem  will  thus  be  proved  if  we  can 
show  that  s  is  non-sing iilar. 
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For  this  purpose  let  us  substitute  in  the  identity 

for  qj^  its  value  from  (3), 

(7)  I  =  qoSiHii  +  q^s. 

Now  divide  q^  by  m^  by  means  of  Theorem  3,  §  95,  in  such  a  way  as 
to  get 

(8)  q^  =  qim2  +  q 

where  q  is  a  matrix  with  constant  elements. 
Substituting  this  value  in  (7),  we  have 

I  =  qoSjiiij  +  q^nigS  +  qs. 

Referring  to  (5),  we  see  that  this  may  be  written 

(9)  I-qs  =  (qoSi  +  qiP)mi. 

From  this  we  infer  that  qQS^  +  qjp  must  be  identically  zero,  and 
therefore 

(10)  I  =  qs. 

For  if  q^Sj  +  q^p  were  not  identically  zero,  the  right-hand  side  of 
(9)  would  be  a  X-matrix  of  at  least  the  first  degree,  while  the  left- 
hand  side  of  (9)  does  not  involve  X. 

Equation  (10)  shows  that  s  is  non-singular,  and  thus  our  theorem 
is  proved.  It  shows  us,  however,  also  that  q  is  non-singular,  and 
that  q  =  s~i,  so  that  equation  (6)  becomes  nig  =  pm^q. 

We  may,  therefore,  add  the  following 

Corollary.  The  matrices  p  and  q  whose  existence  is  stated  in  the 
above  theorem  may  he  obtained  as  the  remainders  in  the  division  of  p^ 
and  qQ  in  (2)  by  m.^  by  means  of  theformulce  : 

Po  =  nigpi  4- p,  qo  =  qim2  +  q. 

From  this  theorem  concerning  X-matrices  of  the  first  degree  we 
can  now  deduce  the  following  theorem  concerning  pairs  of  matrices 
with  constant  elements.  It  is  this  theorem  which  forms  the  main 
foundation  for  such  applications  of  the  theory  of  elementary  divisors 
as  we  shall  give. 

We  shall  naturally  speak  of  two  pairs  of  matrices  with  constant 
elements  ai,  b^  and  a2,  bg  as  equivalent  if  two  non-singular  matrices  p 
and  q  exist  for  which 

(11)  a2  =  paiq,  b2  =  pbiq. 
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Theorem  2.  If  a^,  bj  and  a2,  bg  are  two  pairs  of  matrices 
independent  of  X,  and  if  bj  and  bg  are  non-singular,  a  necessary 
and  sufficient  condition  that  these  two  pairs  of  matrices  be  equiva- 
lent is  that  the  two  \-matrices 


nij  =  a^  —  Xbj,  Td.^  =  3-2  —  Xb 


have  the  same  invariant  factors^  — or,  if  we  prefer,  the  same  elementary 
divisors. 

For  if  the  pairs  of  matrices  are  equivalent,  equations  (11) 
hold;  hence,  multiplying  the  second  of  these  equations  by  X 
and  subtracting  it  from  the  first,  we  have 

(12)  m2  =  pmiq, 

that  is  the  \-matrices  m.^  and  m.^  are  equivalent,  and  therefore  have 
the  same  invariant  factors,  and  the  same  elementary  divisors.  On 
the  other  hand,  it  follows  at  once  from  the  assumption  that  b^  and  bg 
are  non-singular,  that  m.^  and  m.^,  ^^^  non-singular,  and  hence  have 
the  same  rank.  Consequently  if  m^  and  m.^  have  the  same  invariant 
factors,  or  the  same  elementary  divisors,  they  are  equivalent.  Since 
they  are  of  the  first  degree,  there  must,  by  Theorem  1,  exist  two 
non-singular  matrices  p  and  q,  whose  elements  are  independent  of  X, 
which  satisfy  the  identity  (12).  From  this  identity,  the  two  equa- 
tions (11)  follow  at  once  ;  and  the  two  pairs  of  matrices  are  equivalent. 
Thus  the  proof  of  our  theorem  is  complete. 

A  case  of  considerable  importance  is  that  in  which  the  matrices 
bj  and  b2  both  reduce  to  the  unit  matrix  I.  In  this  case  m^  and  nig 
reduce  to  what  are  known  as  the  characteristic  matrices  of  a^  and  a2 
respectively,  according  to  the  following  definition : 

Definition.  Jf  a.  is  a  matrix  of  the  nth  order  with  constant  ele- 
ments and  I  the  unit  matrix  of  the  nth  order,  the  \-matrix 

A  =  a  -  XI 

is  called  the  characteristic  matrix  of  a;  the  determinant  of  A  is  called 
the  characteristic  function  of  a;  and  the  equation  of  the  nth  degree  in 
X  formed  hy  setting  this  determinant  equal  to  zero  is  called  the  char- 
acteristic equation  of  a. 
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We  can  now  deduce  from  Theorem  2  the  following  more  special 

result : 

Theorem  3.  -Zfa^  and  a.^  are  two  matrices  independent  of  X,  a  neces- 
sary and  sufficient  condition  that  a  non-singular  matrix  p  exist  such  that  * 

(13)  -       a2  =  pa^p-i 

is  that  the  characteristic  matrices  A^  and  A2  of  a^  and  a.^  have  the  same 
invariant  factors^  —  or,  if  we  prefer,  the  same  elementary  divisors. 

For  if  Aj  and  A2  have  the  same  invariant  factors  (or  ^lementary 

divisors),  there  exist,  by  Theorem  2,  two  non-singular   matrices  p 

and  q  such  that  <,««/,  t      txT/, 

^  ^2  =  pa^q,  I  =  piq. 

The  second  of  these  equations  shows  us  that  q  =  p~i ;  and  this 
value  being  substituted  in  the  first,  we  see  that  p  is  the  matrix  whose 
existence  our  theorem  asserts. 

That,  on  the  other  hand,  A^  and  Ag  have  the  same  invariant  factors 
and  elementary  divisors  if  equation  (13)  is  fulfilled,  is  at  once  obvious. 

97.  The  Equivalence  of  Pairs  of  Bilinear  Forms.  Suppose  we 
have  a  pair  of  bilinear  forms  in  2  n  variables 

n  n 

</>!  =  ^a'ijXiT/j,  sjr^  =  l.b'ijXiyj, 

and  also  a  second  pair 

n  n 

and  let  us  assume  that  i/r^  and  y^^  are  non-singular.  We  will  in- 
quire under  what  conditions  the  two  pairs  of  forms  are  equivalent, 
that  is,  under  what  conditions  a  first  non-singular  linear  transforma- 
tion for  the  a;'s  and  a  second  for  the  ?/'s. 


X-^  —  ^11*^1  "^    **'     '   ^In'^n 


^n  —  ^nl  ^1    I      "  *      I    (^nn  •^n 


ryi  =  ^ii«/i+  •••  +d\nyn 


Vn —  ^wl^l  4"    •••   "T  Ctnnyn 


can  be  found  which  together  carry  over  <^^  into  </>2  and  i/r^  into  '>^r^. 

*  Two  matrices  connected  by  a  relation  of  the  form  (13)  are  sometimes  called 
similar  matrices.  This  conception  of  similarity  is  evidently  merely  a  special  case  of 
the  general  conception  of  equivalence  as  defined  in  §  29,  the  transformations  considered 
being  of  the  form  (13)  instead  of  the  more  general  form  usually  considered  in  this 
chapter  and  the  last. 
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If  we  denote  the  conjugate  of  the  matrix  c  by  c'  and  the  matrices 
of  <^i,  i/r^,  02^  i/rg  by  Rj,  bj,  3^,  bg  respectively,  we  know,  by  Theorem  1, 
§  36,  that  the  transformations  c,  d  carry  over  (j>^  and  yjr^  into  forms 
with  matrices  c^a^d,  c^b^d 

i 

respectively ;  so  that,  if  these  are  the  forms  <^2  ^^^  "^2'  ^^  have 
(1)  Eg  =  c'ajd,  bg  =  c'b^d. 

Consequently,  by  Theorem  2,  §  96,  the  two  X-matrices 

fti  —  Xbi,  3.2  —  ^^2 

have  the  same  invariant  factors  and  elementary  divisors. 

Conversely,  by  the  same  theorem,  if  these  two  X-matrices  have 
the  same  invariant  factors  (or  elementary  divisors),  two  constant 
matrices  c'  and  d  exist  which  satisfy  both  equations  (1);  and  hence 
there  exists  a  linear  transformation  of  the  a;'s  and  another  of  the  ?/'s 
which  together  carry  over  cj)^  into  (/)2  and  yjr^  into  yjr^.  Thus  we 
have  proved  the 

Theorem.  If  cf)^^  ylr^  and  (^21  '^2  ^'^^  ^^^  pairs  of  bilinear  forms  in  2  n 
variables  of  which  yjr^  and  ^jr^  are  non-singular^  a  necessary  and  sufficient 
condition  that  these  two  pairs  of  forms  be  equivalent  is  that  the  matrices 
of  the  two  pencils  ^^  _  ^^^^  ^^  _  ^^^ 

have  the  same  invariant  factors^ — or,  if  we  prefer,  the  same  elementary 
divisors.* 

EXERCISE 

Prove  that  the  theorem  of  this  section  remains  true  if  the  bihnear  forms 
^1,  xj/i,  <f>2,  ip2  are  real  and  the  term  equivalent  is  understood  to  mean  equivalent  with 
regard  to  real  7ion-singular  linear  transformations. 

98.  The  Equivalence  of  CoUineations.  A  second  important  appli- 
cation of  the  theory  of  elementary  divisors  is  to  the  theory  of  col- 
lineations.  For  the  sake  of  simplicity  we  will  consider  the  case  of 
two  dimensions 


r^f 


Q.    <X2=  <^21*^1  ~l~  ^22*^2  ~^  ^23*^3' 
f         t 

X^  =  a^^X^  +  ^32^2    ■"  ^^33 "^3' 

although  the  reasoning  will  be  seen  to  be  perfectly  general. 

*  For  the  sake  of  brevity,  we  shall,  in  future,  speak  of  these  invariant  factors  and 
elementary  divisors  as  the  invariant  factors  and  elementary  divisors  of  the  pairs  of 
for  inn  0i,  \px  and  02,  'A2  respectively. 
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We  have  so  far  regarded  a  collineation  merely  as  a  means  of 
transforming  certain  geometric  figures.  It  is  possible  to  adopt  an- 
other point  of  view,  and  to  study  the  collineation  in  itself  with 
special  reference  to  the  relative  position  of  points  before  and  after 
the  transformation.  Thus  suppose  we  have  a  figure  consisting  of 
the  points  A^^  A^^  ••• ,  finite  or  infinite  in  number,  and  suppose  these 
points  are  carried  over  by  the  collineation  a  into  the  points  A[^ 
A^^  ••• .  These  two  sets  of  points  together  form  a  geometric  figure. 
It  is  the  properties  of  such  figures  as  this  that  we  call  the  properties 
of  the  collineation.  Such  properties  may  be  either  projective  or 
metrical.  Thus  it  would  be  a  metrical  property  of  a  collineation  if 
it  carried  over  some  particular  pair  of  perpendicular  lines  into  a  pair 
of  perpendicular  lines ;  it  would  be  a  projective  property  of  the 
collineation  if  it  carried  over  some  particular  triangle  into  itself.  We 
shall  be  concerned  only  with  the  projective  properties  of  collineations. 

As  an  example,  let  us  consider  the  fixed  points  of  the  collineation, 
that  is  points  whose  initial  and  final  position  is  the  same.  In  order 
that  (a^j,  2^2,  ^3)  be  a  fixed  point  it  is  necessary  and  sufficient  that 

that  is,  substituting  in  a,  that  a  constant  X  exist  such  that, 

I  1*11    ~~~    Ai  )CC-t    — 1~  12       2   "T"  Cl-inXo     —    v/, 

(Jon-itV-i    "|~  CJl/nnXn      [~     I  Ctoq  '^~    Ai  )«t^q   — ^    V/. 

The  matrix  of  this  system  of  equations  is  precisely  what  we  have 
called  the  characteristic  matrix  of  the  matrix  a  of  the  linear  trans- 
formation. The  characteristic  function  is  a  polynomial  of  the  third 
degree  in  X  which,  when  equated  to  zero,  has  one,  two,  or  three  dis- 
tinct roots.  Let  X^  be  one  of  these  roots.  When  this  is  substituted 
in  (1),  these  equations  are  satisfied  by  the  coordinates  of  one  or  more 
points, — the  fixed  points  of  the  collineation  a.  The  number  and 
distribution  of  these  fixed  points  give  an  important  example  of  a 
projective  property  of  a  collineation ;  and  it  is  readily  seen  that 
collineations  may  have  wholly  different  properties  in  this  respect, 
one  having  three  fixed  points,  another  two,  and  still  another  an 
infinite  number. 

Coming  back  now  to  the  two  sets  of  points  A^^  A^^  •••  and  ^^, 
^2^  •••  which  correspond  to  one  another  by  means  of  the  coUinea- 
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tion  a  (which  may  be  singular  or  non-singular),  let  us  subject  all  these 
points  to  a  non-singular  collineation  c,  which  carries  over  A^,  A^-,  ••• 
into  ^j,  B^,  '••  and  A[,  A'^,  •••  into  B[,  Bi^,  •••  respectively.  The  fig- 
ure formed  by  the  ^'s  will  have  the  same  projective  properties  as 
that  formed  by  the  ^'s;  and  consequently  if  we  can  find  a  collinea- 
tion b  which  carries  over  B^^  B^^  •••  into  B[^  Bi^i  •••,  this  collineation 
will  have  the  same  projective  properties  as  the  collineation  a.  Such 
a  collineation  is  clearly  given  by  the  formula 

(2)  b  =  cac-i 

since  c"^  carries  over  the  points  Bi  into  the  points  Ai,  a  then  carries 
over  these  into  A^,  and  c  carries  over  the  points  A^  into  the  points  j&J. 
Since  two  collineations  a  and  b  related  by  formula  (2)  are  indis- 
tinguishable so  far  as  their  projective  properties  go  (though  they 
may  have  very  different  metncal  properties),  we  will  call  them 
equivalent  according  to  the  following 

Definition.  Tivo  collineations  a  and  b  shall  he  called  equivalent 
if  a  non-singular  collineation  c  exists  such  that  relation  (2)  is  fulfilled. 

A  reference  to  Theorem  3,  §  96,  now  gives  us  the  fundamental 
theorem : 

Theorem.  A  necessary  and  sufficient  condition  that  two  collinea- 
tions he  equivalent  is  that  their  characteristic  matrices  have  the  same  in- 
variant factors, —  or,  if  we  prefer,  the  same  elemeyitary  divisors. 

EXERCISES 

1.  If  Pi,  P^,  '•'  Pk  are  fixed  points  of  a  non-singular  collineation  in  space  of 
n  —  1  dimensions  which  correspond  to  k  distinct  roots  of  the  characteristic  equa- 
tion, prove  that  these  points  are  linearly  independent. 

2.  Discuss  the  distribution  of  the  fixed  points  of  a  collineation 

(a)  in  two  dimensions, 
(h)  in  three  dimensions, 
for  all  possible  cases  of  non-singular  collineations. 

3.  Discuss  the  distribution  of 

(a)  the  fixed  lines  of  a  collineation  in  tw^o  dimensions, 

(b)  the  fixed  planes  of  a  collineation  in  three  dimensions, 

for  all  possible  cases  of  non -singular  collineations ;  paying  special  attention  to  their 
relation  to  the  fixed  points. 

4.  Two  real  collineations,  a  and  b,  may  be  said  to  be  equivalent  if  there  exists 
a  real  non -singular  collineation  c  such  that  b  =  cac"^. 

With  this  understanding  of  the  term  equivalence,  show  that  the  theorem  of  the 
present  section  holds  for  real  collineations. 
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99.  Classification  of  Pairs  of  Bilinear  Forms.  We  consider  again 
the  pair  of  bilinear  forms 

n  n 

of  which  we  assume  the  second  to  be  non-singular,  and  form  the 

X- matrix. 

(1)  a-Xb. 

Using  a  slightly  different  notation  from  that  employed  in  §  92,  we 
will  denote  the  elementary  divisors  of  (1)  by 

(\  -  \)%     (\  -  \)%   (\-  X,yS         (^1  +  ^2  +  •  •  •  +  ^/fc  =  ^> 

so  that  the  linear  factors  A-  —  X^  need  not  all  be  distinct  from  one 
another.  The  most  important  thing  concerning  these  elementary 
divisors  is,  for  many  purposes,  their  degrees^  e^^  e^^  •••  e^.  When  we 
wish  to  indicate  these  degrees  without  writing  out  the  elementary 
divisors  in  full,  we  will  use  the  symbol  \e^  e^  •  •  •  Cj^^  called  the  char- 
acteristic of  the  X-matrix  (1),  of  of  the  pair  of  forms  <^,  i/r.  It  will  be 
seen  that  this  characteristic  is  a  sort  of  arithmetical  invariant  of  the 
pair  of  bilinear  forms,  since  two  pairs  of  bilinear  forms  which  are 
equivalent  necessarily  have  the  same  characteristic.  The  converse 
of  this,  however,  is  not  true,  since  for  the  equivalence  of  two  pairs  of 
bilinear  forms  the  identity  of  the  elementary  divisors  themselves, 
not  merely  the  equality  of  their  degrees,  is  necessary. 

All  pairs  of  bilinear  forms  which  have  the  same  characteristic  are 
said  to  form  a  category.  Thus,  for  example,  in  the  case  of  pairs  of 
bilinear  forms  in  six  variables  we  should  distinguish  between  three 
categories  corresponding  to  the  three  characteristics, 

[1  1  1],  [2  1],  [3], 
which  are  obviously  the  only  possible  ones  in  this  case.  In  fact,  we 
must  inquire  whether  these  three  categories  really  all  exist.  This 
question  we  answer  in  the  affirmative  by  writing  down  the  following 
pairs  of  bilinear  forms  in  six  variables  which  represent  these  three 
categories: 

T     n  1  1 1    i  ''^i^i^i  ~'~    2^2^2  "f"  ^3^32/3' 


Xi  —  X 
0 
0 


0 
Xo  —  X 


0 
0 

Xg  X 
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II.   [2  1J 


^l^l^l  +  ^1^2^2  +  ^1^2  +  ^2^3y3' 


^iUl  ~^       ^2^2 


0 
0 


X, 


0 


~^    '^sl/s'f 


0 

0 

Xg  —  X 


X]^  —  X 

0 

0 


X-j^  —  X 
0 


0 

1 

Xj-X 


The  pairs  of  bilinear  forms  we  have  just  written  down  do  more 
than  merely  establish  the  existence  of  our  three  categories.  They 
establish  the  fact  that  not  only  the  degrees  of  the  elementary  divisors 
are  arbitrary  (subject  merely  to  the  condition  that  their  sum  be 
three),  but  that,  subject  to  this  restriction,  the  elementary  divisors 
themselves  may  be  arbitrarily  chosen.  They  are,  moreover,  normal 
forms  to  one  or  the  other  of  which  every  pair  of  bilinear  forms  in 
six  variables,  of  which  the  first  is  non-singular,  may  be  reduced  by 
non-singular  linear  transformations. 

The  general  theorem  here  is  this: 


Theorem.  If  X^,  X2,  •  •  •  X^^.  are  any  constants^  equal  or  unequal^  and 
ej^  are  any  positive  integers  ivJiose  sum  is  n^  there  exist  pairs  of 
bilinear  forms  in  2  n  variables^  the  second  form  in  each  pair  being  non- 
singular^  which  have  the  elementary  divisors 


^V>  ^2' 


(2) 


(X-Xi)%     {x-\^)% 


(x-x,y*. 


The  proof  of  this  theorem  consists  in  considering  the  pair  of 
bilinear  forms 


(3) 


e,+e„ 


+  +(     2    \^^y^-\-     S    Xi_^, 


n 


^yjr  =  x^y^  +  2-2^2  +  •••  +  x^y^, 
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of  which  the  second  is  non-singular.     These  forms  have  a  X- matrix 
which  may  be  indicated,  for  brevity,  as 


(4) 


Ml 


M, 


M. 


where  the  letters  Mj,  •••  M^t  represent  not  single  terms  but  blocks  of 
terms  ;  M^  standing  for  the  matrix  of  order  e^ 

\-\        1        0    ...        0 
0        X,  -X    1     ...         0 


M,= 


0 


0 


0 


X.  -  X 


while  all  the  terms  of  the  matrix  (4)  are  zero  which  do  not  stand  in 
one  of  the  blocks  of  terms  M,.  The  elementary  divisors  of  (4)  are, 
as  we  see  by  a  reference  to  §  93  (Formula  (1)  and  Theorem  2),  pre- 
cisely the  expressions  (2).     Thus  our  theorem  is  proved. 

A  reference  to  §  97  shows  that  formula  (3)  is  a  normal  form  to 
which  every  pair  of  bilinear  forms  in  2n  variables  with  the  ele- 
mentary divisors  (2)  can  be  reduced.* 

*  Many  other  normal  forms  might  be  chosen  in  place  of  (3).  Thus,  for  instance, 
we  might  have  used  in  place  of  (3)  the  form 


(30 


e.+e^-l 


1  1  e^+1  e^+1 

n-ek+l  n-ejc+l 

Xp  =  "ZCiXiye^-i+l  +  ^C2Xiy2e^+e^-i+l  +  '^Cp.Xiy2e^+2e^+e^~t+l 
1  e,+l  e^+e^+l 

n 

+    ...    +  i:CkXiy2n-ef,-i+h 
n—ejf+l 

where  the  constants  Ci,  ♦.•  c^,  d^  •.•  dk  may  be  chosen  at  pleasure  provided,  merely, 
that  none  of  them  are  zero.    For  instance,  they  may  all  be  assigned  the  value  1. 
u 
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Let  us  now  return  to  the  classification  of  pairs  of  bilinear  forms. 
For  a  given  number,  2  n,  of  variables  we  have  obviously  only  a  finite 
number  of  categories.  We  may  subdivide  these  categories  into 
classes  by  noticing  which,  if  any,  of  the  elementary  divisors  corre- 
spond to  the  same  linear  factor.  This  we  can  indicate  in  the  char- 
acteristic by  connecting  by  parentheses  those  integers  which  are  the 
degrees  of  elementary  divisors  corresponding  to  one  and  the  same 
linear  factor.     Thus,  in  the  case  ?^  =  8,  the  characteristic 

[(2  1)(1  1  1)2] 

would  indicate  that  the  X-matrix  has  just  three  distinct  linear  fac- 
tors ;  that  to  one  of  these  there  correspond  two  elementary  divisors 
of  degrees  two  and  one  respectively,  to  another  three  elementary 
divisors  of  the  first  degree,  and  to  the  last  a  single  elementary 
divisor  of  degree  two. 

Two  pairs  of  bilinear  forms  which  are  equivalent  belong  neces- 
sarily to  the  same  class,  but  two  pairs  of  bilinear  forms  which  be- 
long to  the  same  class"are  not  necessarily  equivalent. 

To  illustrate  what  has  just  been  said,  let  us  again  consider  the 
case  ^  =  3.  Here  we  have  now,  instead  of  three  categories,  six 
classes,  which  are  exhibited  in  the  following  table : 


a 

b 

c 

I. 

[111] 

[(11)1] 

[(1  1  1 )] 

II. 

[2  1] 

III. 

[3; 

The  X-matrix  of  this  pair  of  forms  may  be  written  in  the  form  (4),  where,  how- 
ever, Mi  now  stands  for  the  matrix  of  order  Ci : 


Mi  = 


0  di  Ci(\i  -  X) 

di    Ci(\i  —  X)  0 


Ci(\i  -  X) 


0 


0 


0 


It  will  be  noticed  that  the  matrices  Mj,  and  therefore  also  the  bilinear  forms  (3'), 
are  symmetrical,  a  fact  which  will  make  this  normal  form  important  when  we  come  to 
the  subject  of  quadratic  forms  in  the  next  chapter. 

Constants  similar  to  the  constants  c,  and  di  which  we  have  introduced  in  (3') 
might  also  have  been  introduced  in  (3). 
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The  three  classes  la,  16,  Ic  form  together  the  category  I,  and  are 
all  represented  by  the  normal  form  given  for  that  category  above, 
the  only  difference  being  tliat  in  class  la  the  three  quantities  X^,  Xg, 
Xg  are  all  distinct,  in  class  lb  two,  and  only  two,  of  them  are  equal, 
while  in  class  Ic  they  are  all  equal.  Similarly  category  II  is  now 
divided  into  two  classes,  lla  and  115,  for  both  of  which  the  normal 
form  of  category  II  holds  good,  X^  and  Xg  being,  however,  different 
in  that  normal  form  for  class  lla  and  equal  for  class  115.  Finally 
category  III  consists  of  only  a  single  class. 

For  some  purposes  it  is  desirable  to  carry  this  subdivision  still 
farther.  The  second  of  our  two  bilinear  forms,  yfr^  has  been  assumed 
throughout  to  be  non-singular.  The  first,  </>,  may  be  singular  or 
non-singular ;  and  it  is  readily  seen  that  a  necessary  and  sufficient 
condition  that  (p  be  singular  is  that  one,  at  least,  of  the  constants  X^ 
which  enter  into  the  linear  factors  of  the  X-matrix  be  zero.  Thus  it 
will  be  seen  that  in  a  single  class  we  shall  have  pairs  of  bilinear 
forms  both  of  which  .are  non-singular  and  others  one  of  which  is 
singular,  and  we  may  wish  to  separate  into  different  sub-classes 
the  pairs  of  forms  which  belong  to  one  or  the  other  of  these  two 
cases. 

Let  us  go  a  step  farther  in  this  same  direction,  and  inquire  how 
the  rank  of  cf)  is  connected  with  the  values  of  the  constants  X^.  We 
notice  that  the  matrix  of  <f)  is  equal  to  the  matrix  of  the  pencil  c^  — X-i/r 
when  X  =  0.  Accordingly,  if  </>  is  of  rank  r,  every  (r  +  l)-rowed 
determinant  of  the  matrix  of  (^  —  \^jr  will  be  divisible  by  X,  while  at 
least  one  r-rowed  determinant  of  this  matrix  is  not  divisible  by  X. 
It  is  then  necessary,  as  we  see  by  a  reference  to  the  definition  of 
elementary  divisors  (cf.  the  footnote  to  Definition  3,  §  92),  that  just 
7^  —  r  of  the  constants  X^  which  enter  into  the  elementary  divisors 
should  be  zero.  Since  the  converse  of  these  statements  is  also  true, 
we  may  say  that  a  necessary  and  sufficient  condition  that  the  form  <f) 
be  of  rank  r  is  that  just  n—  r  of  the  elementary  divisors  be  of  the  form 
\^i.  Let  us,  in  the  characteristic  [e^  gg  ***  ^k]-)  place  a  small  zero 
above  each  of  the  integers  e^  which  is  the  degree  of  such  an  elemen- 
tary divisor ;  and  regard  two  pairs  of  bilinear  forms  as  belonging  to 
a  single  class  when,  and  only  when,  their  characteristics  coincide  in 
the  distribution  of  these  zeros  as  well  as  in  other  respects.  Here 
again  two  equivalent  pairs  of  forms  will  always  belong  to  the  same 
class,  but  the  converse  will  not  be  true. 
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As  an  illustration,  let  us  again  take  the  case  n  =  S.  We  have 
now  fourteen  classes  instead  of  six. 

[1 1 1],  [(1  1)1],  [(1 1 1)],  [2  1],  [(2  1)],  [3],  (r  =  3), 

[111],  [(11)1],  [21],  [21],  [3J,  (r=2), 

[(1  V].  [(2  i)],  (r=l). 

[(ill)],  (r=0). 

We  have  indicated,  in  each  case,  the  rank  r  of  the  form  (f).  Thus 
in  the  first  six  cases  (j)  is  non-singular ;  in  the  next  five  it  is  of  rank 
2,  etc. 

EXERCISES 

1.  Prove  that  there  exist  pairs  of  real  bilinear  forms  in  2  n  variables  of  which 
the  second  is  non-singular,  and  which  have  the  elementary  divisors 

(X-Ai)S       (X-A2)%     (X-XkYk  (Ci  +  e2+   -  +e*  =  n), 

provided  that  such  of  these  elementary  divisors  as  are  not  real  admit  of  arrange- 
ment in  conjugate  imaginary  pairs.     (Cf.  Exercises  1,  2,  §  93.) 

2.  Classify  pairs  of  real  bilinear  forms  in  six  variables  (the  second  form 
in  each  pair  being  non-singular),  distinguishing  between  real  and  imaginary 
elementary  divisors. 

100.  Classification  of  Collineations.  The  classification  of  pairs 
of  bilinear  forms  which  we  gave  in  the  last  section  may  obviously 
be  regarded,  from  a  more  general  point  of  view,  as  a  classification 
of  pairs  of  matrices,  the  second  matrix  of  each  pair  being  assumed 
to  be  non-singular.  From  this  point  of  view  it  admits  of  applica- 
tion to  the  classification  of  collineations,  since,  as  we  saw  in  §  98,  to 
every  collin cation  corresponds  a  pair  of  matrices  of  which  one  is 
non-singular,  namely  the  unit  matrix  I  and  the  matrix  of  the  linear 
transformation.  Moreover,  the  normal  form  (3)  of  §  99  is  precisely 
adapted  to  the  treatment  of  the  more  special  kind  of  equivalence 
which  we  have  to  consider  here,  since  the  matrix  of  the  form  -i/r  is 
precisely  the  unit  matrix.  We  may  therefore  state  at  once  the 
fundamental  theorem  : 

Theorem  1.  7f  Xj,  Xg'  ••*  ^a  ^^^  ^^^  constants^  equal  or  unequal^ 
and  g J,  ^21  •  •  •  ^yfc  ^^^  positive  integers  whose  sum  is  n^  there  exists  a  col- 
lineation  in  space  of  n  —  1  dimensions  whose  characteristic  matrix  has 
the  elementary/  divisors 

(X  -  \)%    (X  -  \^)\    (X  -  x,y*. 
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To  this  we  may  add 

Theorem  2.     Every  collineation  of  the  kind  mentioned  in  Theorem 
1  is  equivalent  to  the  collineation  whose  matrix  is 


M, 


M, 


where  M^  stands  for  the  matrix  of  order  e^, 

Xi    1     0     ...    0 


M,= 


0      X,     1 


0 


0      0      0 


We  thus  get  a  classification  of  collineations  into  categories  and 
a  subdivision  of  these  categories  into  classes  precisely  as  in  §  99. 
For  instance,  in  the  case  n  =  S  (collineations  in  the  plane),  we  have 
three  categories  whose  characteristics  and  representative  normal 
forms  we  give: 


I.   [Ill] 


II.   [21] 


III.   [3] 


X-i  —  A--!  X-i 

Jbn   — 


^2*^2 


,Xn    


7   _ 


XgO^g, 


X-^  —  K-^X-^  -f-  Xc^^ 


*^2  — 
X^  = 


^1^2 


^2*^3' 


X-i     A,-!  X-i    ~f~   Xn 

1*^2    '    *^3 


•^2  — 

^Xn    = 


^1^3* 


These  categories  we  should  then  subdivide  either  into  six  classes 
as  on  page  290  or  into  fourteen  classes  as  on  page  292.  This  latter 
classification  is  the  desirable  one  in  this  case.  We  proceed  to  give  a 
list  of  these  fourteen  classes  with  a  characteristic  property  of  each. 
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That  the  normal  forms  of  the  coUineations  have  these  properties 
will  be  at  once  evident,  and  from  this  it  follows  that  all  the  coUinea- 
tions of  the  class  have  the  property  in  question,  since  the  properties 
mentioned  are  obviously  all  projective.  That  the  properties  men- 
tioned are  really  characteristic  properties,  that  is,  serve  to  distin- 
guish one  class  from  another,  can  only  be  seen  a  posteriori^  by  noticing 
that  no  one  of  the  properties  mentioned  is  shared  by  two  classes. 
[1 1 1]      Three  distinct  non-collinear  fixed  points.* 

[(1 1)1]  Every  point  of  a  certain  line  and  one  point  not  on 
this  line  are  fixed. 

[(1 1 1)]  The  identical  collineation. 

[2  1]  Two  distinct  fixed  points. 

[(2  1)]  Every  point  of  a  certain  line  is  fixed. 

[3]  One  fixed  point. 

In  all  these  cases  the  collineation  is  non-singular.  The  remain- 
ing coUineations  are  singular.  In  the  next  three,  one  point  P  of  the 
plane  is  not  transformed  at  all,  while  all  other  points  go  over  on  to 
a  line  p  which  does  not  pass  through  P,  and  every  one  of  whose 
points  corresponds  to  an  infinite  number  of  points. 

0 

[1 1 1]     There  are  two  fixed  points  on^. 
[(1 1)  1]   Every  point  on  p  is  fixed. 

0 

[2  1]         One  fixed  point  on  p. 

In  the  next  two  cases  one  point  P  is  not  transformed  at  all, 
while  all  other  points  go  over  on  to  a  line  p  which  passes  through  P, 
and  every  one  of  whose  points  corresponds  to  an  infinite  number  of 
points. 

0 

[2  1]   One  fixed  point. 

0 

[3]      No  fixed  point. 

The  remaining  coUineations  are  so  simple  that  they  are  not  merely 
characterized,  but  completely  described,  by  the  property  we  mention. 

0     0 

[(1 1)  1]  The  points  on  a  certain  line  are  not  transformed.  All 
other  points  go  over  into  a  single  point  which  does 
not  lie  on  this  line. 

*  It  should  be  understood  here  and  in  what  follows  that  the  fixed  points  which 
are  mentioned  are  the  only  fixed  points  of  the  collineation  in  question. 
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0    0 

[(2  1)]        The  points  on  a  certain  line  are  not  transformed.     All 
other  points  go  over  into  a  single  point  on  this  line. 

0    0    0 

[(1 1 1)]     No  point  in  the  plane  is  transformed. 
This  last  case  is  of  course  not  a  transformation  at  all. 


EXERCISES 

1.  Classify,   in   a   similar   maimer,   the    projective    transformations   in   one 
dimension. 

2.  Classify  the  collineations  in  space  of  three  dimensions. 

3.  Classify  the  real  projective  transformations  in  space  of  one,  two,  and  three 
dimensions.     (Cf.  Exercises  1,2,  §  99.) 


CHAPTER  XXII 

THE    EQUIVALENCE    AND    CLASSIFICATION    OF    PAIRS    OF 

QUADRATIC   FORMS 

101.  Two  Theorems  in  the  Theory  of  Matrices.  In  order  to  jus- 
tify the  applications  we  wish  to  make  of  the  theory  of  elementary 
divisors  to  the  subject  of  quadratic  forms,  it  will  be  necessary  for  us 
to  turn  back  for  a  moment  to  the  general  theory  of  matrices. 

Definition.     If  (^{x)  is  a  polynomial : 

(\>{x)  =  aQX^-\-  a^ a;™-i  +    •  •  •    +  a^.^x  +  a^, 
then  a^x^  +  a^jL^~^ -\-    •••    -fa^_jX4-«„iI 

is  called  a  polynomial  in  the  matrix  x  and  is  denoted  by  <^(x).* 

We  come  now  to  one  of  the  most  fundamental  theorems  in  the 
whole  theory  of  matrices : 

Theorem  1.  If'  a.  is  a  matrix^  and  (j>(X)  its  characteristic  func- 
tion, then  ^(^a)  =  0. 

This  equation  is  called  the  Hamilton-Cayley  equation. 
Let  c  be  the  characteristic  matrix  of  a : 

c  =  a  —  XI. 

This  being  a  \-matrix  of  the  first  degree,  its  adjoint  C  will  be  a 
X-thatrix  of  degree  not  higher  than  ?i— 1,  if  n  is  the  order  of  the  matrix  a : 

(1)  C  =  C„_jX''-i  +  C„.2\»-2  +    -   +  Co- 
We  may  also  write 

(2)  (^(X)  =  A'„X-  +  ^.-iX«-i+    -    +K 

Now  referring  to  formula  (5),  §  25,  we  see  that 

aC  -  XC  =  <^(X)I. 

*  It  should  be  noticed  that,  according  to  this  definition,  the  coefficients  of  a  poly- 
nomial in  X  are  scalars.  Contrast  this  with  a  X-matrix,  in  which  the  coefficients  are 
matrices  and  the  variable  a  scalar.  Both  of  these  conceptions  would  be  included  in 
expressions  of  the  form  : 

aox'^bo  4-  aix'»-ibi  +  •••  +  a„»_ixbm_i  +  a^- 
296 
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Substituting  here  from  (1)  and  (2),  we  have,  on  equating  corre- 
sponding powers  of  \,     p  _  z.  t 

clV/1     V/Q    ^    A^i  X, 

aC/2         ^1  ^^  ^2^' 


If  we  multiply  these  equations  in  succession  by  I,  a,  a^,  •••  a", 
and  add,  the  first  members  cancel  out,  and  we  get 

Jc^^I  +  k^si  +  k^a?  +   •••   4-  A:„a^  =  0. 

This  is  precisely  the  equation 

<Kaj=0 
which  we  wished  to  establish.^ 

As  a  means  of  deducing  our  second  theorem,  we  next  establish  a 
lemma  which  relates  merely  to  scalar  quantities. 

Lemma.  If  ^/^(a:)  is  a  polynomial  of  the  nth  degree  (n  >  0)  whose 
constant  term  is  not  zero^  there  exists  a  polynomial  x(^)  ^f  degree  less 
than  n  such  that  /   /  wo 

is  divisible  by  -^(x). 

Let  x—a^x—b\x  —  c,  •••  be  the  distinct  linear  factors  of  "^{x)^ 
so  that  we  may  write 

yjr(x)  =  k(x  —  ay(x—by(x  —  c)y  •••  («  +  yS  +  7+    •••    =  n). 

None  of  the  constants  a^  b,  c,  •••  are  zero,  since,  by  hypothesis,  the 
constant  term  of  yjr  is  not  zero.  Let  us,  further,  denote  by  yj^iix)  the 
polynomial  obtained  from  yjr  by  omitting  the  factor  (x  —  a)%  by 
yjr2{x)  the  polynomial  obtained  from  yjr  by  omitting  the  factor  (x—by^ 
etc.,  and  finally  let  us  form,  with  undetermined  coefficients,  the 
polynomials 

A(x)  =  AQ  +  A^(x-a)-{-  Alx  -af-\-   •  •  •   +  A,_^{x  -  ay-\ 

B(x)=B,-\-B,{x-b)-{-B^(x-bf-\-   ...   -^B^_,{x-by-\ 

C{x)  =  (7o  -f  C^{x  -c)+  Clx  -cf-\-   ...   +  Q^_lx  -  c)y-\ 
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From  these  polynomials  we  now  form  the  polynomial 
X{x)  =  A{x)flx)  +  B(x)ylr,{x)  +  C{x)ylr^{x)  +    •  •  • 

whose  degree  can  obviously  not  exceed  w  —  1.  We  wish  to  show 
that  the  coefficients  A^^  B^^  •••  can  be  so  determined  that  this  poly- 
nomial X(x)  satisfies  the  conditions  of  our  lemma. 

Since  '^^^  i/rg,  •  •  •  are  all  divisible  by  {x  —  a)%  a  necessary  and 
sufficient  condition  that  (x(x)f  —  xhe  divisible  by  this  factor  is  that 
the  polynomial  ^^^  ^  {A{x)f(f,ix)f  -  x 

be  divisible  by  (x  —  ay.     We  have 

(l>(a)  =  Alk\a-bf^(a-cfy  •••-«. 

In  order  that  (f)(x)  be  divisible  by  a;  —  <x  it  is  therefore  necessary  and 
sufficient  that 

^^^  ^^  ^ k\a-hf\a-cfy  .-.' 

Neither  numerator  nor  denominator  here  being  zero,  we  thus 
obtain  two  distinct  values  for  A^^  both  different  from  zero.  If  we 
give  to  A(^  one  of  these  values,  <l>{x)  is  divisible  by  x  —  a.  A  neces- 
sary and  sufficient  condition  that  it  be  also  divisible  by  {x  —  of  is 
that  <i>\ci)  =  0,  accents  here,  and  in  what  follows,  denoting  differentia- 
tion. We  shall  see  in  a  moment  that  this  condition  can  be  imposed 
in  one,  and  only  one,  way  by  a  suitable  choice  of  A^  The  condi- 
tion that  <^{x)  be  divisible  by  {x  —  of  is  then  simply  <i>'\a)  =  0.  We 
wish  to  show  that  this  process  can  be  continued  until  we  have  finally 
imposed  the  condition  that  <^{x)hQ  divisible  by  (a;  —  a)".  For  this 
purpose  we  use  the  method  of  mathematical  induction,  and  assume 
that  Aq^  •••  As_-^  have  been  so  determined  that  (^{a)  =  <^\a)=  ••• 
=  <^f*-iJ(a)  =  0.  It  remains  then  merely  to  show  that  Ag  can  be  so 
determined  that  (f>^\d)  =  0.     For  this  purpose  we  notice  that 

(4)  ^^\x)  =  2A^\x)A{x)^^\rlxyf  +  Rlx) 

where  i^X^)  i^  ^^  integral  rational  function  with  numerical  coeffi- 
cients of  ^i,  i/rj,    ...  i/rW,  J.,  J.^  ...  J.[*-il     Since 

A{a)  =  A,,  A'{a)^  A^,  A''{a)^2lA^,  ...  A^^-^a)=:{s-l)\A,_^, 

it  follows  that  R{a)  is  a  known  constant,  that  is,  that  it  does  not 
depend  on  any  of  the  still  undetermined  constants  Ag^  A^+i^  •  •  •  Aa_^^ 


PAIRS   OF   QUADRATIC    FORMS  299 

nor  on  the  ^'s,  Cs,  etc.  Consequently  we  see  from  (4)  that  a  neces- 
sary and  sufficient  condition  that  </>f*^(a)  =  0  is  that  A^  have  the  value 

(5)  A= ~  ^•^(^)      . 

Determining  the  coefficients  J.^  A^^  •••  A^_^  in  succession  by  means 
of  this  formula,  we  finally  determine  the  polynomial  A{x)  in  such  a 
way  that  <^(a;)  is  divisible  by  {x  —  ay.  For  this  determination, 
ix^^yf  ~  ^  ^^^f  as  we  saw  above,  be  divisible  by  {x  —  ay. 

In  the  same  way  we  can  now  determine  the  coefficients  of  B(x)  so 
that  (x(^)f  —  a;  is  divisible  by  (x  —  by;  then  we  determine  the  coeffi- 
cients of  C{x)  so  that  (x(^)f  —  a:^  is  divisible  by  (x—  cy;  etc.  When 
all  the  polynomials  A^  B,  C^  •  •  •  are  thus  determined,  (X{x)y  —  a;  is 
divisible  by  '^{x)^  and  our  lemma  is  proved. 

Theorem  2.  If  a.  is  a  non-singular- matrix  of  order  n,  there  exist  ma- 
trices b  of  order  n  {necessarily/  non-singular)  with  the  following  properties : 

b2  =  a, 

b  is  a  polynomial  in  a  of  degree  less  than  n. 

Since  a  is  non-singular,  its  characteristic  function  4>(\)  is  a  poly- 
nomial of  the  nth  degree  whose  constant  term  is  not  zero.  Hence,  by 
the  preceding  lemma,  a  polynomial  %(X)  of  degree  less  than  n  can  be 
determined  such  that     ^^^^^y  _^_  ^(x)/(A:) 

where /(X)  is  also  a  polynomial.     From  this  identity  it  follows  that 

(X(a))2-a  =  <^(a)/(a). 

Since,  by  Theorem  1,  <^(a)  =  0,  tlie  last  equation  may  be  written 

(X(a)f  =  a,      _ 

so  that  b  =  %(a)  is  a  matrix  satisfying  the  conditions  of  our  theorem, 
which  is  thus  proved. 

102.  Symmetric  Matrices.  The  application  of  the  theory  of  ele- 
mentary divisors  to  the  subject  of  quadratic  forms  rests  on  the  fol- 
lowing proposition: 

Theorem  1.  If  sli  and  a.^  are  symmetric  matrices  and  if  there 
exist  two  non-singular  matrices  p  and  q  such  that 

(1)  a2  =  paiq. 
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then  there  also  exists  a  non- singular  matrix  P  such  that 

(2)  a2  =  P'aiP 

where  P'  is  the  conjugate  of  P.* 

Let  us  denote  by  p'  and  q'  the  conjugates  of  p  and  q  respectively. 
Taking  the  conjugates  of  both  sides  of  (1),  and  remembering  that 
a^  and  sl^,  being  symmetric,  are  their  own  conjugates,  we  get,  by 
Theorem  6,  §  22, 

(3)  a2  =  q'alP^ 

By  equating  the  values  of  a^  in  (1)  and  (3),  we  readily  deduce  the 
further  relation 

(4)  (q'rpai  =  aip'q-^ 

For  brevity  we  will  let 

(5)  U  =  (q'r'p,  U'=p'q->, 

and  note  that  U'  is  the  conjugate  of  U  ;  cf.  Exercise  6,  §  25.     Equa- 
tion (4)  may  then  be  written 

(6)  Uai  =  alU^ 

From  this  equation  we  infer  at  once  the  following  further  ones  : 

U^i  =  Ua^U'  =  aiU'2, 
U%i  =  UaiU'2=aiU'^ 


(7) 


,k-i 


U*ai  =  UaiU'    '  =  ajU'  . 


Let  us  now  multiply  the  equations  (6)  and  (7)  and  also  the  equa- 
tion a^  =  a^  by  any  set  of  scalar  constants  and  add  them  together. 
We  see  in  this  way  that  if  %  (U)  is  any  polynomial  in  U, 

(8)  x(U)a,  =  aix(U'). 

*  A  proof  of  this  theorem  much  simpler  than  that  given  in  the  text  is  the  following : 
From  (1)  we  infer  at  once  that  ai  and  a^  have  the  same  rank.  Hence  the  quad- 
ratic forms  of  which  ai  and  &2  are  the  matrices  are  equivalent  to  each  other  by  Theorem 
4,  §  46.  If  we  denote  by  P  the  matrix  of  the  linear  transformation  which  carries  over 
the  quadratic  form  ai  into  the  form  a2,  we  see,  from  Theorem  1,  §  43,  that  equation  (2) 
holds. 

This  proof  would  not  enable  us  to  infer  that  P  can  be  expressed  in  terms  of  p  and  q 
alone,  and  this  is  essential  for  our  purposes. 
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We  will  choose  the  polynomial 

V  =  x(U) 
so  that  V  is  non-singular  and 

as  is  seen  to  be  possible  by  Theorem  2,  §  101.     Denoting  by  V'  the 
conjugate  of  V,  we  evidently  have 

so  that  we  may  write  (8)  in  the  form 

Vai  =  alV^ 
or  al  =  V-lalV^ 

We  now  substitute  this  value  in  (1)  and  get 
(9)  a2  =  pV-iaiV^q. 

From  the  first  equation  (5)  we  infer  the  formula 

pV-i=q'V. 

Consequently  pV~i  is  the  conjugate  of  V'q,  so  that  if  we  let 

P  =  V^q, 

equation  (9)  may  be  written 

a2  =  P'aiP, 
and  our  theorem  is  proved. 

The  proof  just  given  enables  us  to  add  the 

Corollary.  As  the  matrix  P  of  the  foregoing  theorem  may  he 
taken  the  matrix  V'q  where  V  is  the  conjugate  of  any  one  of  the  square 
roots^  determined  by  Theorem  2,  §  101,  of  (q')~^  p. 

In  particular  it  will  be  seen  that  P  depends  on  p  and  q  but  not  on 
a^  or  ag.  Hence  if  a^,  ag,  b^,  \  are  symmetric  matrices,  and  there 
exist  two  non-singular  matrices  p  and  q  such  that 

^2  =  paiq,  bg  =  pb^q, 

then  there  exists  a  non-singular  matrix  P  such  that 

a2  =  P'aiP'  b2  =  P'biP. 


302  INTRODUCTION   TO   HIGHER   ALGEBRA 

From  this  and  Theorem  2,  §  96,  we  infer 

Theorem  2.  If  a^,  ag,  bj,  bg,  are  symmetric  matrices  of  which  b^,  bg 
are  non-singular,  a  necessary  and  sufficient  condition  that  a  non-singu- 
lar matrix  P  exist  such  that 

(10)  a2  =  P'aiP,  b2  =  P'biP, 

where  P'  is  the  conjugate  of  P,  is  that  the  matrices 

a^  —  Xbj,  a2  —  Xb2 

have  the  same  invariant  factors,  —  or,  if  we  prefer,  the  same  elementary 
divisors. 

If,  in  particular,  b^  =  b2  =  I,  where  I  is  the  unit  matrix,  we  have, 
from  the  second  equation  (10),  the  formula 

I=P'P. 

Such  a  matrix  P  we  call  an  orthogonal  matrix  according  to  the  defini- 
tion, which  will  readily  be  seen  to  be  equivalent  to  the  one  given  in 
the  first  footnote  on  page  154  : 

Definition.  By  an  orthogonal  matrix  we  understand  a  non-singu- 
lar matrix  whose  inverse  is  equal  to  its  conjugate. 

In  the  special  case  just  referred  to.  Theorem  2  may  be  stated  in  the 
following  form: 

Theorem  3.  If  a^  and  a^  are  two  symmetric  matrices,  a  necessary 
and  sufficient  condition  that  an  orthogonal  matrix  P  exist  such  that 

a2=P'aiP 

is  that  the  characteristic  matrices  of  a^  and  0.2  have  the  same  invariant 
factors,  —  or,  if  we  prefer,  the  same  elementary  divisors. 

If  this  theorem  is  compared  with  Theorem  3,  §  96,  it  will  be  seen 
that  it  differs  from  it  only  in  two  respects,  first  that  a^  and  a2  are 
assumed  to  be  symmetric,  and  secondly  that  P  is  required  to  be 
orthogonal. 

103.  The  Equivalence  of  Pairs  of  Quadratic  Forms.  Let  us  con- 
sider the  two  pairs  of  quadratic  forms 

n  n 

<^i  =  2  alj  Xi  x;  '^1  =  ^  H  ^i  ^P 

1  1 

n  n 

and  <^2  =  ^  a'ijXiXj^  '^2  =  ^ K^i^j-) 
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of  which  the  two  forms  sjr-^  and  yjr^  are  assumed  to  be  non-singular. 
We  will  inquire  under  what  conditions  these  two  pairs  of  forms  are 
equivalent ;   that  is,  under  what  conditions  a  linear  transformation 


•^n  —  ^711*^1      I        "*       I     ^nrv^v. 


exists  which  carries  over  c^^  into  ^^  and,  at  the  same  time,  -^/r.^  into  •^^. 
If  we  denote  the  conjugate  of  the  matrix  c  by  c^  and  the  ma- 
trices of  the  forms  (^j,  -i/r^,  ^^^  -^^  by  a^  b^,  ag,  bg  respectively,  we 
know,  by  Theorem  1,  §  43,  that  the  transformation  c  carries  over  </>j 
and  i/r-^  into  forms  with  the  matrices 

c^a^c,  c'b^c 

respectively;  so  that,  if  these  are  the  forms  ^^  and  i/rg,  we  have 
(1)  «2  =  c'ajC,  b2  =  c^bjC. 

Consequently,  by  Theorem  2,  §  102,  the  two  X-matrices 

a^  —  Xbj,  a2  —  Xb2 

have  the  same  invariant  factors  and  elementary  divisors. 

Conversely,  by  the  same  theorem,  if  these  two  X-matrices  have 
the  same  invariant  factors  (or  elementary  divisors),  a  matrix  c,  inde- 
pendent of  X,  exists  which  satisfies  both  equations  (1)  ;  and  hence 
the  two  pairs  of  quadratic  forms  are  equivalent.  Thus  we  have 
proved 

Theorem  1.  If  <^j,  -i/r^  and  (f)^,  '^^  aye  two  pairs  of  quadratic 
forms  in  n  variables^  in  which  -v/r^  and  yjr^  are  non-singular,  a  necessary 
and  sufficient  condition  that  these  two  pairs  of  forms  he  equivalent  is 
that  the  matrices  of  the  two  pencils 

have  the  same  invariant  factors,  —  or,  if  we  prefer,  the  same  elemen- 
tary divisors,* 

A  special  case  of  this  theorem  which  is  of  considerable  impor- 
tance is  that  in  which  both  of  the  forms  i/r^  and  y^r^  reduce  to 

*For  brevity,  we  shall  speak  of  these  invariant  factors  and  elementary  divisors  as 
the  invariant  factors  and  elementary  divisors  of  the  pairs  of  forms  0i,  \p\  and  02?  i^z 
respectively. 
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In  this  case  we  have  to  deal  with  orthogonal  transformations  (of.  the 
Definition  in  Exercise  1,  §  52),  and  our  theorem  may  be  stated  in 
the  form  * 

Theorem  2.  If  ^i^  and  ag  aye  the  matrices  of  two  quadratic  forms^ 
a  necessary  and  sufficient  condition  that  there  exist  an  orthogonal  trans- 
formation which  carries  over  one  of  these  forms  into  the  other  is  that  the 
characteristic  matrices  of  a^  and  a.^  have  the  same  invariant  factors^ 
—  or,  if  we  prefer^  the  same  elementary  divisors. 

To  illustrate  the  meaning  of  the  theorems  of  this  section,  let  us 
consider  again  briefly  the  problem  of  the  simultaneous  reduction  of 
two  quadratic  forms  to  sums  of  squares.  In  Chapter  XIII  we  be- 
came acquainted  with  two  cases  in  which  this  reduction  is  possible ; 
cf .  Theorem  2,  §  58,  and  Theorem  2,  §  59.  We  are  in  a  position  now 
to  state  a  necessary  and  sufficient  condition  for  the  possibility  of 
this  reduction,  provided  that  one  of  the  two  forms  is  non-singular. 

For  this  purpose,  consider  the  two  quadratic  forms 


-y^  =  c^x\  +  c^x\  -\- 


"t~  l^vP^n') 


where  we  assume,  in  order  that  the  second  form  may  be  non-singular, 
that  none  of  the  c^  vanish.     The  matrix  of  the  pencil  (/>  —  \^  is 


\ 


(?jX 


0 


0 

^2  ~  ^2^ 


0 


0 


0 


K  -  c^ 


and  the  elementary  divisors  of  this  matrix  are 


X-^, 


Cc^ 


c„ 


all  of  the  first  degree.  Consequently,  any  pair  of  quadratic  forms 
equivalent  to  the  pair  just  considered  must  have  a  X-matrix  whose 
elementary  divisors  are  all  of  the  first  degree. 

Conversely,  if  we  have  a  pair  of  quadratic  forms,  of  which  the 
first  is  non-singular,  whose  X-raatrix  has  elementary  divisors  all  of 

*  This  theorem  is,  of  course,  essentially  equivalent  to  Theorem  3,  §  102,  of  which  it 
may  be  regarded  as  an  immediate  consequence. 
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the  first  degree,  we  can  obviously  choose  the  constants  k  and  c 
in  such  a  way  that  the  X-matrix  of  the  forms  <^  and  ^^r  just  con- 
sidered has  these  same  elementary  divisors,  and  therefore  the  given 
forms  are  equivalent  to  these  special  forms  (\>  and  '^.  Thus  we  have 
proved  the  theorem  : 

Theorem  3.  If  (i>  and  -xjr  are  quadratic  forms  and  yfr  is  non-singu- 
lar^ a  necessary/  and  sufficient  condition  that  it  he  possible  to  reduce  <f> 
and  yfr  simultaneously  hy  a  non- singular  linear  transformation  to  forms 
into  which  only  the  square  terms  enter  is  that  all  the  elementary  divisors 
of  the  pair  of  forms  he  of  the  first  degree. 

This  theorem  obviously  includes  as  a  special  case  Theorem  2  of 
§  58,  since  the  elementary  divisors  are  necessarily  of  the  first  degree 
when  the  \-equation  has  no  multiple  roots. 

Comparing  the  theorem  just  proved  with  Theorem  2,  §  59,  we  see 
that  under  the  conditions  of  that  theorem  the  elementary  divisors 
must  be  of  the  first  degree.     Hence 

Theorem  4.  If  "^  is  a  non-singular.,  definite^  quadratic  form.,  and 
(j)  is  a  real  quadratic  form,  all  the  elementary  divisors  of  this  pair  of 
forms  are  necessarily  of  the  first  degree. 

104.  Classification  of  Pairs  of  Quadratic  Forms.  We  consider 
the  pair  of  quadratic  forms 

n  n 

(1)  ,  (^  =  ^aijXiXj,  ylr  =  l.hij.XiXj,. 

and  assume,  as  before,  that  yjr  is  non -singular.  We  denote  the  ele- 
mentary divisors  of  these  forms,  as  in  §  99,  by 

(X  -  X^)%         (\  -  \^\  ...  (X  -  X,)^*         (g^  +  ^2  +  •  •  •  +  ^*  =  ^)- 

The  symbol  \e^  e^  •••^a]  we  call  the  characteristic  of  the  pair  of 
quadratic  forms;  and  all  pairs  of  quadratic  forms  which  have  the 
same  characteristic  we  speak  of  as  forming  a  category.* 

We  have  here,  precisely  as  in  the  case  of  bilinear  forms,  the 
theorem  : 

Theorem.  If  X^,  Xg,  •  •  •  X;^  are  any  constants.,  equal  or  unequal., 
and  e^.,  e^.,  ...  ej^  are  any  positive  i7itegers  whose  sum  is  n,  there  exist  pairs 

*  Thus,  for  instance,  all  pairs  of  forms  of  which  the  second  is  non-singular  and  which 
admit  of  simultaneous  reduction  to  sums  of  squares,  form  a  category  whose  character- 
istic is  [1  1  •••  1].     Cf.  Theorem  3,  §  103. 

X 
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of  quadratic  forms  in  n  variables^  the  second  form  in  each  pair  being 
non-singular^  which  have  the  elementary  divisors 

(2)  (X-Xi)%         (X-\2)%    i^-M'- 

The  proof  of  this  theorem  consists  in  considering  the  following 
pair  of  quadratic  forms,  analogous  to  the  normal  form  (3')  of  §  99: 


(3) 


^1+1  e^+l 

n— 1  n-\ 

+  -  +1' 

1  e^+l  ^,+^2+1 


n-e,.+l  n-ei.+l  ' 


where  (?j,  •  •  •  ^^,  c?j,  •  •  •  cZ^t  are  constants  which  may  be  chosen  at  pleas- 
ure, provided  none  of  them  are  zero. 

The  X-matrix  of  this  pair  of  forms  is  the  same  as  the  X-matrix  of 
the  pair  of  bilinear  forms  (3^)  of  §  99,  and  therefore  has  the  desired 
elementary  divisors. 

A  reference  to  Theorem  1,  §  103,  shows  that  formula  (3)  yields  a 
normal  form  to  which  every  pair  of  quadratic  forms,  of  which  the 
second  is  non-singular  and  whose  elementary  divisors  are  given  by 
(2),  can  be  reduced. 

The  categories,  of  which  we  have  so  far  spoken,  may  be  divided 
into  classes  by  the  same  methods  we  used  in  §  99  in  the  case  of 
bilinear  forms.  This  may  be  done,  as  before,  either  by  simply  noting 
which  of  the  X/s  are  equal  to  each  other,  or  by  further  distinguishing 
between  the  cases  where  some  of  the  X/s  are  zero. 

We  are  now  in  a  position  to  see  exactly  in  what  way  our  elemen- 
tary divisors  give  us  a  more  powerful  instrument  than  we  had  in  the 
invariants  ©^  of  §  57.  These  invariants  @i,  being  the  coefficients  of 
the  X-equation  of  our  pair  of  forms,  determine  the  constants  X^,  which 
are  the  roots  of  this  equation,  as  well  as  the  multiplicities  of  these 
roots.  They  do  not  determine  the  degrees  e^  of  the  elementary  di- 
visors, and  the  use  of  the  ©/s  alone  does  not,  in  all  cases,  enable  us 
to  determine  whether  two  pairs  of  forms  are  equivalent  or  not. 
Thus,  for  instance,  we  may  have  two  pairs  of  forms  with  exactly  the 
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same  invariants  @f  but  with  characteristics  [(11)11  •••1]  and 
[211  •••  1]  respectively.*  It  will  be  seen,  therefore,  that  the  @/s 
form  in  only  a  very  technical  sense  a  complete  system  of  invariants. 

EXERCISES 

1.  Form  a  numerical  example  in  the  case  n  =  3  to  illustrate  the  statement 
made  in  the  next  to  the  last  sentence  of  this  section. 

2.  Prove  that  if  two  equivalent  pairs  of  quadratic  forms  have  two  elementary 
divisors  of  the  first  degree  which  correspond  to  the  same  linear  factor,  there  exist 
an  infinite  number  of  linear  transformations  w^hich  carry  over  one  pair  of  forms 
into  the  other. 

3.  Prove  the  general  theorem,  of  which  Exercise  2  is  a  special  case,  namely, 
that  if  two  equivalent  pairs  of  quadratic  forms  have  a  characteristic  in  which  one 
or  more  parentheses  appear,  there  exist  an  infinite  number  of  linear  transforma- 
tions which  carry  over  one  pair  of  forms  into  the  other. 

4.  Prove  that  if  two  equivalent  pairs  of  quadratic  forms  have  a  characteristic 
in  which  no  parentheses  appear,  only  a  finite  number  of  linear  transformations 
exist  which  carry  over  one  pair  of  forms  into  the  other,  f 

How  are  these  transformations  relrated  to  each  other? 

105.  Pairs  of  Quadratic  Equations,  and  Pencils  of  Forms  or  Equa- 
tions. J  In  dealing  with  quadratic  forms,  the  questions  of  equiva- 
lence and  classification  do  not  always  present  themselves  to  us  in 
precisely  the  form  in  which  we  have  considered  them  in  the  last  two 
sections.  We  frequently  have  to  deal  not  with  the  quadratic  forms 
themselves  but  with  the  equations  obtained  by  setting  the  forms 
equal  to  zero.  Two  such  pairs  of  equations  we  shall  regard  as 
equivalent,  not  merely  if  the  forms  in  them  are  equivalent,  but  also 
if  one  pair  of  forms  can  be  obtained  from  the  other  by  multiplication 
by  constants  different  from  zero. 

Let  us  consider  two  quadratic  forms  <^,  -v/r,  of  which  we  assume, 
as  before,  that  the  second  is  non-singular,  and  inquire  what  the 
effect  on  the  elementary  divisors 

(1)  (X  -  Xi)%      (X  -  \)% (X  -  x.y.    ^ 

*  We  may,  in  the  case  n  =  3,  put  the  same  thing  geometrically  (cf.  the  next  sec- 
tion) by  saying  that  it  is  impossible  to  distinguish  between  the  case  of  two  conies  having 
double  contact  and  that  of  two  conies  having  simple  contact  at  a  single  point  by  the  use 
of  the  invariants  0i  alone,  whereas  these  two  cases  are  at  once  distinguished  by  the  use 
of  elementary  divisors. 

t  The  exercise  in  §  58  is  practically  a  special  case  of  this. 

\  Questions  similar  to  those  treated  in  this  section  might  have  been  taken  up  in 
the  last  chapter  for  the  case  of  bilinear  forms. 
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of  these  forms  will  be  if  the  forms  are  multiplied  respectively  by  the 
constants  jt?,  q  which  are  both  assumed  to  be  different  from  zero. 
Let  us  write 

Then 

(2)  <^i-\ti^i>(^-^'t) 

where  x^—l\. 

P 

Let  X  —  a  be  any  one  of  the  linear  factors  of  the  matrix  of  </>  —  X-v/r, 
so  that  a  is  any  one  of  the  constants  X^,  Xg,  •••  X;;. ;  and  let  us  denote, 
as  in  the  footnote  to  Definition  3,  §  92,  by  Z^  the  exponent  of  the 
highest  power  of  X  —  a  which  is  a  factor  of  all  the  i-rowed  deter- 
minants of  this  matrix.  Then  it  is  clear,  from  (2),  that  l^  is  the 
exponent  of  the  highest  power  of  X'  —  a  which  is  a  factor  of  all  the 
^-rowed  determinants  of  the  matrix  of  </>j  —  X-i/r^.     In  other  words, 


9. 

is  the  highest  power  of  the  linear  factor  X  —  jpa/q  which  is  a  factor 
of  all  the  z-rowed  determinants  of  the  matrix  of  </)j  —  \'>^y  Turning 
now  to  the  definition  of  elementary  divisors  as  given  in  the  footnote 
to  Definition  3,  §  92,  we  see  that  the  elementary  divisors  of  the  matrix 
of  ^j  —  Xi/r^  differ  from  those  of  the  matrix  of  <^  —  X'\/r  only  in  having 
the  constants  X^-  replaced  by  the  constants  p\/q'  We  thus  have  the 
result : 

Theorem  1.  If  the  pair  of  quadratic  forms  <^,  i/r,  of  which  the 
second  is  assumed  to  be  non-singular,  has  the  elementary  divisors 

(X  -  \)%       (X  -  \)S        (X  -  X,> , 

and  if  p,  q  are  constants  different  from  zero,  then  the  pair  of  quadratic 
forms  p(f>,  qyjr  has  the  elementary  divisors 

(X  -  \[)%     (X  -  x^)%      (X  -  xiyic 

where  Xj  =  -X,- 

In  particular,  it  will  be  seen  that  these  two  pairs  of  forms  have 
the  same  characteristic,  even  when  the  conception  of  the  character- 
istic is  refined  not  merely  by  inserting  parentheses  but  also  by  the 
use  of  the  small  zeros. 
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The  theorem  just  proved  shows  that  pairs  of  homogeneous 
quadratic  equations,  of  which  the  second  equation  in  each  pair  is 
non-singular,  may  be  classified  by  the  use  of  their  characteristics 
precisely  as  was  done  in  the  last  section  for  pairs  of  quadratic  forms. 
We  proceed  to  illustrate  this  in  the  case  w  =  3,  where  we  may  con- 
sider that  we  have  to  deal  with  the  classification  of  pairs  of  conies 
in  a  plane,  one  of  the  conies  being  non-singular. 

We  have  here  three  categories  represented  by  the  following 
normal  forms :  * 

[  i/r  =     x\-{-  x\      —     a;|. 


I.  [1  1  1] 

II.  [2  1] 

III.  [3] 


<j)  =  2  \j  :r J  3^3  +  X  ja:|  -f-  2x^X2 
ylr=^       2x-^X^-\-  x\. 


We  next  subdivide  these  categories  into  classes,  and,  by  an  ex- 
amination of  the  normal  form  in  each  case,  we  are  enabled  at  once  to 
characterize  each  class  by  certain  projective  properties  which  it  has, 
and  which  are  shared  by  no  other  class,  f  Since  the  conic  -v/r  is  non- 
singular  in  all  cases,  this  fact  need  not  be  explicitly  stated, 

[111]      </)  and  -^  intersect  in  four  distinct  points. 

[(1 1)  1]  <^  and  i/r  have  double  contact. 

[(1 1 1)]   </)  and  -v/r  coincide. 

[2  1]         <f)  and  i/r  meet  in  three  distinct  points  at  one  o±  which 
they  touch. 

[(2  1)]      (f)  and  i/r  have  contact  of  the  third  order. 

[3]  </)  and  s^  have  contact  of  the  second  order. 

In  all  of  the  above  cases  <^,  as  well  as  i/r,  is  non-singular. 
In  the  next  five  cases,  </>  consists  of  a  pair  of  distinct  straight 
lines. 

*  We  assign  to  the  constants  Ci  and  ^<,  in  formula  (3)  of  the  last  section,  values  so 
chosen  that  the  loci  0  =  0,  ^  =  0  are  real  when  the  constants  Xj  are  real.  This  is,  of 
course,  not  essential,  since  we  are  not  concerned  with  questions  of  reality. 

t  In  order  to  verify  the  statements  made  below,  the  reader  should  have  some 
knowledge  of  the  theory  of  the  contact  of  conies;  cf.  for  instance  Salmon's  Conic 
Sections,  Chapter  XIV.,  pages  232-238. 
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0 

[111]      <l>  and  yjr  intersect  in  four  distinct  points. 

0 

[(1 1)1]   Both  of  the  lines  of  which  <^  consists  touch  sir. 

0 

[2  1]  One  of  the  lines  of  which  cj)  consists  touches  i/r,  while 
the  other  cuts  it  in  two  points  distinct  from  the  point 
of  contact  of  the  first. 

0 

[2  1]  The  two  lines  of  which  (j)  consists  intersect  on  yjr^  and 
neither  of  them  touches  yjr. 

0 

[3]  The  two  lines  of  which  cp  consists  intersect  on  yjr,  and 

one  of  them  touches  yjr. 

In  the  next  two  cases,  (j>  consists  of  a  single  line. 

0   0  _ 

[(1 1)  1  ]   The  line  <^  meets  n/r  in  two  distinct  points. 

0    0 

[(2  1)]      The  line  </>  touches  '^. 
Finally  we  have  the  case: 

0    0    0 

[(1 11])     Here  (^  =  0,  and  we  have  no  conic  other  than  yjr. 

Suppose  finally  that  we  wish  to  classify  not  pairs  of  quadratic  forms 
or  equations  but  pencils  of  quadratic  forms  or  equations.  Consider  the 
pencil  of  quadratic  forms  j^  _  ^^ 

where  (f)  and  yjr  are  quadratic  forms,  and  yjr  is  non-singular,  and 
suppose  that  the  elementary  divisors  of  the  pair  of  forms  c^,  yjr  are 
given  by  formula  (1)  above.  The  question  presents  itself  whether, 
if,  in  place  of  the  forms  </>,  -i/r,  we  take  any  other  two  forms  of  the 

the  constants  fi,  v  being  so  chosen  that  fJL^v  and  that  ^/r^  is  non- 
singular,  the  pair  of  forms  <^j,  -^y,  will  have  these  same  elementary 
divisors  (1).  If  this  were  the  case,  we  could  properly  speak  of  (1) 
as  the  elementary  divisors  of  the  pencil.  This,  however,  is  not  the 
case,  and  the  pencil  of  quadratic  forms  cannot  properly  he  said  to  have 
elementary  divisors.^ 

*  We  here  regard  the  pencil  as  merely  an  aggregate  of  an  infinite  number  of 
quadratic  forms,  namely,  all  the  forms  which  can  be  obtained  from  the  expression 
<p  —  \\p  hy  giving  to  \  different  values.  In  this  sense  we  cannot  speak  of  the  elementary 
divisors  of  the  pencil.  If,  however,  we  wish  to  regard  the  polynomial  in  the  x's  and  X, 
<f)  —  X;^,  as  the  pencil,  we  may  speak  of  its  elementary  divisors,  meaning  thereby  simply 
what  we  have  called  the  elementary  divisors  of  the  pair  of  forms  <p,  ■^. 
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There  is,  however,  a  simple  relation  between  the  elementary 
divisors  of  two  pairs  of  forms  taken  from  the  same  pencil.  In  order 
to  show  this,  let  us  determine  the  elementary  divisors  of  the  pair  of 
forms  <^j,  o/tj,  above.  For  this  purpose  consider  the  expression 
<^j  —  Xyjr^^  which,  when  X  =7^  1,  may  be  written 

(3)  <^^_X>/r,^(l-X)[c^-X'^] 

where  X'  =  ^~  ^    . 

J.    —    A, 

Now  suppose,  as  above,  that  \  —  a  is  any  one  of  the  linear  factors  of 
the  matrix  of  <^  —  X>/r,  and  that  Z^  is  the  exponent  of  the  highest 
power  oi  \  —  a  which  is  a  factor  of  all  the  i-rowed  determinants  of 
this  matrix.  Then  any  one  of  the  ^-rowed  determinants  of  the  ma- 
trix of  </>  —  X'yjr  may,  when  X  =5^  1,  be  written  in  the  form 

(X'  -  «)y(\') 

where  /  is  a  polynomial  in  X'  of  degree  not  greater  than  i  —  l^. 
Accordingly,  by  (3),  the  corresponding  ^-rowed  determinant  of  the 
matrix  of  <^j  —  X-^^  may  be  written 

[;^_^X-a(l-X)]^*/i(X) 

where /j  is  a  polynomial  in  X.     Thus  we  see  that 

a  —  V 

is  a  factor  of  every  i-rowed  determinant  of  the  matrix  of  <^j  —  Xi/r^. 
Similar  reasoning,  carried  through  in  the  reverse  order,  shows  that 
this  is  the  highest  power  of  a  —  /x 

a—v 
which  is  a  factor  of  all  these  z-rowed  determinants.     Hence 

Theorem  2.  If  the  pair  of  quadratic  forms  </>,  ^/r,  of  which  the 
second  is  non-singular,  have  the  elementary  divisors 

(X  -  Xi)%         (X  -  X2)%     (X  -  \j,)% 

and  if  /jl,  V  are  any  two  constants  distinct  from  each  other  and  such 
that  V  is  distinct  from  all  the  constants  X^,  X2^  •••  Xj^.,  then  the  two  forms 

of  which  the  second  will  then  he  non-singular,  will  have  the  elementary 
divisors  (X-Xi)%         (X-X^)%      (^-Ky 

where  \[  =  ^^^^^^  (^  =  1,2,...  h\ 
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In  particular,  it  will  be  seen  that  the  two  pairs  of  forms  <^,  -yjr  and 
<^j,  'i^j  have  the  same  characteristic  [gj  ^2  '"  ^J  ^^^^  if  we  put  in 
parentheses  to  indicate  which  of  the  e's  correspond  to  equal  X/s. 
The  characteristics  will  not,  however,  necessarily  be  the  same  if  we 
put  in  small  zeros  to  indicate  which  of  the  es  correspond  to  vanish- 
ing Xj's,  since  Xj  and  Xj  do  not  usually  vanish  together.  Accord- 
ingly, in  classifying  pencils  of  quadratic  forms,  we  may  use  the 
characteristic  of  any  pair  of  distinct  forms  of  the  pencil,  the  second 
of  which  is  non-singular,  but  we  must  not  introduce  the  small  zeros 
into  these  characteristics.  This  classification,  of  course,  applies  only 
to  what  may  be  called  non-singular  pencils^  that  is,  pencils  whose 
forms  are  not  all  singular. 

It  will  readily  be  seen  that  what  has  just  been  said  applies  with- 
out essential  change  to  the  case  of  pencils  of  homogeneous  quadratic 
equations.  We  may  therefore  illustrate  it  by  the  classification  of 
non-singular  pencils  of  conies.*  We  have  here  six  classes  of  pencils 
which  we  characterize  as  follows : 

[1 1 1]        The  conies  all  pass  through  four  distinct  points. 

[(1 1)  1]     The  conies  all  pass  through  two  points  at  which  they 

have  double  contact  with  each  other. 
[(1 1 1)]     The  conies  all  coincide. 
[2  1]  The    conies  all  pass  through   three  points  at   one   of 

which  they  touch  one  another. 
[(21)]         The  conies  all  pass  through  one  point  at  which  they 

have  contact  of  the  third  order. 
[3]  The   conies  all  pass  through   two  points,  at   one   of 

which  they  have  contact  of  the  second  order. 

EXERCISES 

1.  Determine,  by  the  use  of  elementary  divisors,  the  nature  of  each  of  the 
following  pairs  of  conies : 

3  xf  +  7  a:i  +  8  0:1  X2  —  10  2:2  2:3  +  4  rri  0:3  =  0 

2  x1  -{-  3  xl  —  Xs  -\-  4:  X1X2  —    Q  X2X3  +  Q  xixs  =  0. 

icl  —  xl  —  S  xl  —  ^  X1X2  +  ^  X2X3  -\-  xi^Js  =  0 
'?  +  ^2  ~     xl  —  2xiX2  —  2x2X3-\-2xiXs=  0. 

2.  Give  a  classification  of  pairs  of  binary  quadratic  equations,  the  second 
equation  of  each  pair  being  non-singular,  and  interpret  the  work  geometrically . 


(a){ 
^^    [2x1 


*  For  a  similar  classification  of  pencils  of  quadrics  we  refer  to  p.  46  of  Bromwich's 
book  :  Quadratic  Forms  and  their  Classification  by  Means  of  Invariant  Factors. 
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106.  Conclusion.  We  wish,  in  this  section,  to  point  out  some 
of  the  important  questions  connected  with  the  subject  of  elementary 
divisors,  which,  in  order  to  keep  our  treatment  within  proper  limits, 
we  have  been  obliged  to  leave  out  of  consideration. 

If  (^j,  yjr^  and  (/>2,  yjr^  are  two  pairs  of  bilinear  or  quadratic 
forms  of  which  ^|r-^^^  -^jr^  are  non-singular,  we  have  found  a  method 
of  determining  whether  these  two  pairs  of  forms  are  equiva- 
lent or  not.  If  we  use  the  invariant  factors  instead  of  the  ele- 
mentary divisors,  our  method  involves  only  the  use  of  the  rational 
operations  (addition,  subtraction,  multiplication,  and  division), 
and  can,  therefore,  be  actually  carried  through  in  any  concrete 
case.  In  fact  we  have  explained  in  §  93  some  really  practical 
methods  of  determining  the  invariant  factors  of  a  X-matrix,  so 
that  the  problem  of  determining  whether  or  not  two  pairs  of 
bilinear  or  quadratic  forms,  the  second  form  in  each  pair  being 
non-singular,  are  equivalent,  may  be  regarded  as  solved,  not 
merely  from  the  theoretical,^  but  also  from  the  practical  point  of 
view. 

There  is,  however,  another  question  here,  which  we  have  not 
treated,  namely,  if  the  two  pairs  of  forms  turn  out  to  be  equiva- 
lent, to  find  a  linear  transformation  which  carries  over  one  into 
the  other.  This  problem,  too,  we  may  consider  that  we  have 
solved  from  a  theoretical  point  of  view;  for  the  proof  we  have 
given  that  if  two  pairs  of  forms  have  the  same  elementary 
divisors  there  exists  a  linear  transformation  which  carries  over 
one  pair  of  forms  into  the  other,  consisted,  as  will  be  seen  on 
examination,  in  actually  giving  a  method  whereby  such  a  linear 
transformation  could  be  determined.  In  fact,  in  the  case  of  bilinear 
forms,  the  processes  involved  are,  here  again,  merely  the  rational 
processes ;  so  that,  given  two  equivalent  pairs  of  bilinear  forms,  the 
second  form  of  each  pair  being  non-singular,  we  are  in  a  position  to 
find,  in  any  concrete  case,  linear  transformations  of  the  x's  and  ^/'s 
which  carry  over  one  pair  of  forms  into  the  other.  Even  here  the 
arrangement  of  the  work  in  a  practical  manner  might  require 
further  consideration. 

In  the  case  of  quadratic  forms  the  problem  becomes  a  much  more 
difficult  one,  inasmuch  as  the  processes  involved  in  the  determination 
of  the  required  linear  transformation  are  no  longer  rational ;  cf.  the 
Lemma  of  §  101.     That  this  is  not  merely  a  defect  of  the  method  we 
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have  used,  but  is  inherent  in  the  problem  itself,  will  be  seen  by  a  con- 
sideration of  simple  numerical  examples.     Let,  for  instance, 

rl—    •^i  "I"     *^2'  r  2  —     *^1  •^2* 

Here  the  pairs  of  forms  ^j,  y\r^  and  (/)2,  -v/^g  both  have  the  elementary 

'^^^^^"'"^  X  -  2,  X  -  3, 

and  are  therefore  equivalent.  The  linear  transformation  which 
carries  over  one  pair  of  forms  into  the  other  cannot,  however,  be 
real  (and  therefore  its  coefficients  cannot  be  determined  rationally 
from  the  coefficients  of  the  given  forms)  since  ^^  and  i/r^  are  definite, 
<^2  and  -^2  indefinite. 

We  have,  therefore,  here  the  problem  of  devising  a  practical 
method  of  determining  a  linear  transformation  which  carries  over  a 
first  pair  of  quadratic  forms  into  a  second  given  equivalent  pair.  A 
method  of  this  sort,  which  is  a  practical  one  when  once  the  elemen- 
tary divisors  have  been  determined,  will  be  found  in  Bromwich's 
book  on  quadratic  forms  referred  to  in  the  footnote  on  p.  312. 

Another  point  at  which  our  treatment  is  incomplete  is  in  the 
restriction  we  have  always  made  in  assuming  that,  in  the  pair  of 
bilinear  or  quadratic  forms  </>,  -v/r,  the  form  i|r  is  non-singular.  Al- 
though this  is  the  case  in  many  of  the  most  important  problems  to 
which  one  wishes  to  apply  the  method  of  elementary  divisors,  it  is  still 
a  restriction  which  it  is  desirable  to  remove.  This  may  be  done  in 
part  by  making  use  not,  as  we  have  done,  of  the  pencil  <^  — Xi/r,  but 
of  the  more  general  pencil  /i</)  —  X-i/r,  /x  and  X  being  variable  param- 
eters. The  determinants  of  the  matrix  of  this  pencil  are  binary 
forms  in  (/x,  X),  and  the  whole  subject  of  elementary  divisors  admits 
an  easy  extension  to  this  case,  the  elementary  divisors  being  now 
integral  powers  of  linear  binary  forms.  The  only  case  which  can- 
not be  treated  in  this  way  is  that  in  which  not  only  <^  and  i/r  are 
both  singular,  but  every  form  of  the.  pencil  /xc/)  —  X-v/r  is  singular. 
This  singular  (?asg,  which  was  explicitly  excluded  by  Weierstrass  in 
his  original  paper,  requires  a  special  treatment  which  has  been  given 
by  Kronecker.  Cf.,  for  the  case  of  quadratic  forms,  the  book  of 
Bromwich  already  referred  to. 

Still  another  question  is  the  application  of  the  method  of  ele- 
mentary divisors  to  the  case  in  which  the  two  forms  <^,  i/r  are  real, 
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and  only  real  linear  transformations  are  admitted.  In  the  case  of 
bilinear  forms,  this  question  presents  no  serious  difficulty;  cf.  the 
exercises  of  §§  97,  99.  In  the  case  of  quadratic  forms,  however,  the 
irrational  processes  involved  in  the  proof  of  the  Lemma  of  §  101 
introduce  an  essential  difficulty,  since  they  are  capable  of  introduc- 
ing imaginary  quantities.  Moreover,  this  difficulty  does  not  lie 
merely  in  the  method  of  treatment.  The  theorems  themselves 
which  we  have  established  do  not  remain  true,  as  is  seen  by  a  refer- 
ence to  the  numerical  example  given  earlier  in  this  section  for  an- 
other purpose,  where  we  have  two  pairs  of  real  quadratic  forms 
which,  although  they  have  the  same  elementary  divisors,  are  not 
equivalent  with  regard  to  real  linear  transformations. 

We  must  content  ourselves  with  merely  mentioning  this  impor- 
tant subject,  and  referring,  for  one  of  the  fundamental  theorems,  to 
p.  69  of  the  book  of  Bromwich. 

For  further  information  concerning  the  subject  of  elementary 
divisors  the  reader  is  referred  to  Muth's  Theorie  und  Anwendung  der 
^lementartheiler^  Leipzig,  Teubner,  1899.  In  English,  the  book  of 
Bromwich  already  referred  to  and  some  sections  in  Mathews'  revision 
of  Scott's  Determinants  will  be  found  useful. 
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product  of  two,  26; 

rank  of,  22 ; 

skew-symmetric,  59; 

symmetric,  56. 
Discriminant  of  a  binary  biquadratic,  260; 

of  a  binary  cubic,  239; 

of  a  binary  form,  237 ,  259 ; 

of  a  polynomial  in  one  variable,  250; 

of  a  quadratic  form,  128 ; 

of  a  quadric  surface,  118. 
Division  of  polynomials,  180; 

of  \-matrices,  278 ; 

of  matrices,  75. 
Divisor  of  zero,  65,  80. 
Domain  of  rationality,  175,  212,  216. 
Dyad  (dyadic  polynomial),  79. 
Dyalitic  method  of  elimination,  199. 


Element  of  a  determinant  or  matrix,  20 ; 

of  a  set,  system,  or  group,  81. 
Elementary  divisors  of  a  X-matrix,  271 ; 

of  a  coUineation,  286; 

of  a  pair  of  bilinear  forms,  284 ; 

of  a  pair  of  quadratic  forms,  303 ; 

simple,  composite,  270. 
Elementary  symmetric  function,  242,  253. 
Elementary  transformation  of  a  matrix,  55 ; 

of  a  X-matrix,  262. 
Elimination,  198,  217,  238. 
Equations,  linear,  43 ; 

homogeneous,  linear,  47 ; 

quadratic,  149; 

quadratic,  pairs  of,  307; 

quadratic,  pencils  of,  312. 
Equianharmouic  points,  107. 
Equivalence,  92. 
Equivalent  matrices,  55,  93; 

coUineations,  286; 

X-matrices,  263,  274 ; 

pairs  of  bilinear  forms,  283 ; 

pairs  of  matrices,  281 ; 

pairs  of  quadratic  forms,  170,  303; 

quadratic  forms,  135,  148. 
Euclid's  algorithm,  189,  192,  206. 
Euler's  theorem  for  homogeneous  functions, 
237. 


Factors  of  a  polynomial,  174,  187,  203;  see 

invariant  factors. 
Fixed  points  of  a  collineation,  285 ; 

lines,  planes  of  a  collineation,  286. 
Forms,   4;   see  bilinear,  quadratic^  binary 
forms ; 
biquadratic,  260; 
cubic,  239; 
polar,  127. 


Fours  group,  87. 
Fractional  matrices,  86. 
Frobenius,  262,  270. 
Fundamental  system  of  solutions,  49; 
theorem  of  algebra,  16. 

Generator  of  a  quadric  surface,  119. 

Gibbs,  79. 

Greatest  common  divisor  of  integers,  188 ; 

of  polynomials  in  one  variable,  191,  197; 

of  polynomials  in  two  variables,  206. 
Ground-form,  96. 
Group,  80; 

Abelian  or  commutative,  83; 

cyclic,  87 ; 

fours  group,  87 ; 

isomorphic,  83 ; 

sub-,  83. 
Group  property,  82. 
\ 

Hamilton,  79,  296. 
Harmonic  division,  104. 
Homogeneity,  principle  of,  226. 
Homogeneous  co(»rdinates,  11 ; 

invariants,  230; 

linear  equations,  47,  49; 

polynomials,  4. 

Idemfactor,  74. 

Identical  vanishing  (equality)  of  polynomi- 
als, 2,5,7,  10; 

element  of  a  group,  82 ; 

transformation,  67. 
Indefinite  quadratic  form,  150. 
Index  of  inertia  of  quadratic  form,  146. 
Invariant,  absolute  algebraic,  89; 

arithmetical,  91, 94, 115, 124, 129, 146, 287 ; 

complete  system  of,  93 ; 

geometric,  88,  103; 

homogeneous,  230; 

integral   rational,   99,  101,  115,  129,  137, 
159,  16(5,  218,  259,  260; 

irrational,  167,  259,  260 ; 

rational,  96,  222;   see  also  integral,  ra- 
tional; 

relative  algebraic,  96,  115;   see  integral 
rational. 
Invariant  factors  of  a  X-matrix,  269 ; 

of  a  collineation,  286; 

of  a  pair  of  bilinear  forms,  284 ; 

of  a  pair  of  quadratic  forms,  303. 
Inverse  of  a  transformation,  67 ; 

of  an  element  of  a  group,  82 ; 

of  a  matrix,  75,  80; 

of  a  quadratic  form,  160. 
Isobaric  polynomial,  222; 

symmetric  function,  245,  255,  256. 
Isomorphic  groups,  83. 
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Jacobi,  144. 


Kronecker,  4,  139,  262,  314. 


X-equation  of  two  conies,  164 ; 

of  two  quadratic  forms,  166. 
X-matrix,  262. 
Lagrange's  reduction,  131. 
Laplace's  development,  26. 
Law  of  Inertia,  144 ; 

of  Nullity,  78,  80. 
Line  at  infinity,  13. 
Line-coordinates,  108,  110. 
Linear  dependence,  conditions  for,  36-38 ; 

of  geometric  configurations,  39; 

of  polynomials,  35,  38; 

of  sets  of  constants,  35,  48. 
Linear  equations,  43,  47,  49; 

transformations,  66. 
Linear  factors  of  polynomials  in  one  vari- 
able, 187 ; 

of  binary  forms,  188; 

of  X-matrices,  270. 


Matrix,  theory  of,  20-22,  54-66,  74-80,  86,  93, 
262-283,  296-302; 
adjoint  of,  77,  80 ; 
as  a  complex  quantity,  60; 
augmented,  44; 
conjugate,  21; 
determinant  of,  21; 
division  of  one  by  another,  75 ; 
elementary  transformation  of,  55 ; 
equivalent,  55 ; 
fractional,  86 ; 
inverse  of,  75 ; 

multiplication  by  matrix,  63; 
multiplication  by  scalar,  62 ; 
normal  form  of  a,  56  (Exercise  3) ; 
normal  form  of  a  symmetrical,  59; 
of  a  bilinear  form,  114 ; 
of  a  determinant,  21 ; 
of  a  quadratic  form,  128; 
of  a  quadric  surface,  118 ; 
of  a  system  of  linear  equations,  44 ; 
of  a  transformation,  66. 
orthogonal,  154,  302,  304; 
powers  of,  75 ; 
product  of  two,  63; 
rank  of,  22 ; 

rank  of  product  of  two,  77 ; 
scalar,  76 ; 
similar,  283; 
singular,  65 ; 
skew-symmetric,  59; 
sum  or  difference  of  two,  62 ; 


Matrix  —  Contimied. 

symmetric,  56 ; 

transposed,  21; 

unimodular,  83; 

unit,  74. 
Minors  of  a  determinant,  22 ; 

complementary,  22; 

corresponding,  31 ; 

principal,  23,  57-59. 
Mixed  concomitants,  109. 
Moore,  86. 

Multiplication  theorem,  28. 
Multiplicity  of  roots  of  an  equation,  18; 

of  pieces  of  curves  and  surfaces,  211,  214. 


Neighborhood  of  a  point,  8,  16,  214. 

Newton's  formulae,  244, 

Normal  form  ot  a  bilinear  form,  116; 

of  a  binary  biquadratic,  261 ; 

of  a  binary  cubic,  239; 

of  a  X-matrix,  267  ; 

of  a  matrix,  56  (Exercise  3)  ; 

of  a  pair  of  bilinear  forms,  289; 

of  a  pair  of  quadratic  forms,  169,  171,  306 ; 

of  a  quadratic  form,  135 ; 

of  a  quadric  surface,  124; 

of  a  real  quadratic  form,  148 ; 

of  a  symmetrical  matrix,  59. 
Nullity,  Sylvester's  Law  of,  78,  80. 
Null-system,  117, 


Order  of  a  determinant  or  matrix,  20 ; 

of  a  group,  87. 
Orthogonal  transformation,  matrix,  determi- 
nant, 154,  173,  ;i02,  304. 


Pencil  of  conies,  163,  312 ; 

of  bilinear  forms,  279: 

of  quadratic  forms,  165,  310. 
Period  of  an  element  of  a  group,  87. 
Plane  at  infinity,  13; 

conjugate,  158. 
Plane-coordinates,  107. 
Point  in  space  of  n  dimensions,  9; 

at  infinity,  12; 

conjugate,  121; 

equation  of  a,  107,  108 ; 

neighborhood  of  a,  8,  16,  214. 
Polar  plane,  122 ; 

form,  127 ; 

tetrahedron,  125. 
Pole,  124. 
Polynomial,  definition,  degree  of,  etc,  1,  4; 

continuity  of  a,  14; 

corresponding,  178; 

dyadic,  79; 
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in  a  matrix,  296 ; 

isobaric,  222 ; 

linear  dependence  of,  35 ; 

real,  5; 

roots  of  a,  18; 

symmetric,  240,  252. 
Prime  (relatively)  polynomials,  175. 
Principal  minors,  23,  57-59. 
Product,  degree  of,  1,  4  ; 

of  determinants,  26 ; 

of  matrices,  63,  277. 
Projective  transformation,  69; 

property,  88,  232. 
Pseudo-tangent  lines  and  planes,  120. 

Quadratic  forms,  127 ; 

adjoint  of,  159 ; 

definite  and  indefinite,  150 ; 

invariants  of,  129,  137,  146,  159,  165,  303; 

inverse  or  reciprocal  of,  160; 

law  of  inertia  of,  144 ; 

matrix,  discriminant,  rank  of,  128; 

normal  forms  of,  135,  148,  169,  171,  306 ; 

pairs  of,  165,  302 ; 

polar  of,  127 ; 

real,  144 ; 

reducibility  of,  136,  147; 

reduction  of,  to  sum  of  squares,  131,  139, 
167,  170,  173 ; 

regularly  arranged,  147; 

signature  of,  146 ; 

singular,  128; 

vertex  of,  129. 
Quadric  surface,  matrix,  discriminant,  rank 
of,  118; 

classification  of,  123,  149,  173 ; 

ruling  of,  119 ; 

singular,  118; 

tangent  to,  119,  120,  155. 
Quantic,  4; 
Quaternary  form,  5. 

Rank    of    a    matrix    or    determinant,    22, 
54-59; 

of  a  bilinear  form,  114; 

of  a  X-matrix,  262  ; 

of  a  quadratic  form ,  128 ; 

of  a  quadric  surface,  118 ; 

of  a  system  of  homogeneous  linear  equa- 
tions, 47 ; 

of  a  system  of  points  or  linear  forms, 
94; 

of  the  adjoint  of  a  quadratic  form,  161 ; 

of  the  product  of  two  matrices,  77. 
Rational   invariants,  96,   222;   see  also  in- 
variant, integral,  rational ; 

of  a  \-matrix,  270. 


Rational  relation,  244. 
Rationality,  domain  of,  175,  212,  216. 
Ray  coordinates,  113. 
Real  polynomials,  5,  174; 

matrix,  X-matrix,  elementary  transforma- 
tion, 278 ; 

quadratic  forms,  144-154,  161,  170-173. 
Reciprocal  or  inverse  of  a  quadratic  form, 

160. 
Reciprocation,  117. 
Reducibility  of  a  polynomial,  174 ; 

in  a  domain,  174,  175  ; 

of  bilinear  forms,  116; 

of  binary  forms,  188 ; 

of  determinants,  176; 

of  polynomials  in  one  variable,  187 ; 

of  quadratic  forms,  136,  147. 
Regularly  arranged  quadratic  form,  147. 
Resultant  of  linear  forms,  95 ; 

of  two  binary  forms,  201,  236,  239,  257; 

of  two  polynomials  in  one  variable,  195, 
239,  248. 
Roots  of  a  polynomial  or  equation,  18. 
Ruling  of  a  quadric  surface,  119. 

S-functions,  241,  253. 
S-functions,  240,  252. 
Scalar,  62 ; 

matrix,  76. 
Self-conjugate  tetrahedron,  125; 

triangle,  164. 
Serai-definite  quadratic  form,  150. 
Set  of  objects,  80. 
Sgn,  147. 

Signature  of  a  quadratic  form,  14G. 
Similar  matrices,  283. 
Simple  elementary  divisors,  270. 
Singular  matrix,  65 ; 

bilinear  form,  114; 

conic,  163,  272; 

linear  transformation,  67; 

quadratic  form,  128; 

quadric  surface,  118. 
Skew-symmetric  determinant,  59 ; 

bilinear  form,  117; 

matrix,  59. 
Smith,  H.  J.  S.,  262. 
Subgroup,  83. 
Subresultant,  197. 
Sylvester,  78,  144,  199,  262. 
Symbolic  product  of  bilinear  forms,  114. 
Symmetric    determinant    and    matrix,    56, 
299; 

bilinear  form ,  115 ; 

binary  function,  255 ; 

polynomial,  240; 

polynomial  in  pairs  of  variables,  252 ; 

ternary  function,  257. 
System,  80. 
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Tangent  lines  and  planes  to  quadric  surface, 

true  and  pseudo,  119,  120. 
Ternary  form,  5 ; 

symmetric  function,  257. 
Transformation,  affiue,  70; 

determinant  and  matrix  of,  66; 

elementary  (of  a  matrix),  55,  262; 

identical,  67; 

inverse,  67; 

linear,  66 ; 

orthogonal,  154,  173,  304; 

projective,  69; 

singular,  67. 
Transposed  matrix,  21. 


Unimodular  matrix,  83. 
Unit  matrix,  74. 


Vertex  of  a  cone,  120,  122,  123,  156; 
of  a  quadratic  form,  129. 


Weierstrass,  262,  270,  314. 
Weight  of  an  invariant,  96,  225; 

of  a  covarlant,  97,  226 ; 

of  a  polynomial,  222 ; 

of  a  symmetric  polynomial,  245,  253. 
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